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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 




(i) APPLICANT: Hilton, Douglas J. 

Alexander, Warren s. 
Viney, Elizabeth M. 
Willson, Tracy A. 
Richardson, Rachae/ T, 
Starr, Robyn 
Nicholson, Sandr^f E. 
Metcalf, Donald j 
Nicola, Nicos 



(ii) TITLE OF INVENTION: 
(iii) NUMBER OF SEQUENCES: 5 




PEUTIC AND DIAGNOSTIC AGENTS 



(iv) CORRESPONDENCE ADDRESS 

(A) ADDRESSEE: Scully, Scott, Murphy & Presser 

(B) STREET: 400 Gafcden City Plaza 

(C) CITY: Garden City, New York 

(D) STATE: New York 

(E) COUNTRY: U.S'.A. 

(F) ZIP: 11530 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYKE: Floppy disk 

(B) COMPUTER: /IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE/ Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/962,560 

(B) FILING DATE: 31-OCT-1997 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/AU97/ 00729 

(B) FILING DATE: 31-OCT-1997 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: P05117 

(B) FILING DATE: 14-FEB-1997 

(vii) PRIOR/ APPLICATION DATA: 

(A) /APPLICATION NUMBER: PO 3384 

(B) / FILING DATE: 01-NOV-1996 

(viii) ATTORNEY / AGENT INFORMATION: 

(A) NAME: DiGiglio, Frank S. 

(B) REGISTRATION NO: 31,346 
f(C) REFERENCE/DOCKET NUMBER: 10976 
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(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (516) 742-4343 

(B) TELEFAX: (516) 742-4366 
I (C) TELEX: 



(2) INFORMATION FOR SEQ ID NO:l: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: S^Q ID NO:l: 
CACGCCGCCC ACGTGAAGGC 
(2) INFORMATION FOR SEQ ID NO/2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 basse pairs 

(B) TYPE: nucleia acid 

(C) STRANDEDNESS/ single 

(D) TOPOLOGY: l/near 

(ii) MOLECULE TYPE: JDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
TTCGCCAATG ACAAGACGCT, 
(2) INFORMATION FOR £EQ ID NO: 3 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1235 base pairs 

(B) TYPET: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) /NAME/KEY : CDS 
(BY LOCATION: 161. .799 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CGAGGCTCA# GCTCCGGGCG GATTCTGCGT GCCGCTCTCG CTCCTTGGGG TCTGTTGGCC 



GGCCTGTGCC ACCCGGACGC CCGGCTCACT GCCTCTGTCT CCCCCATCAG CGCAGCCCCG 
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GACGCTATGG CCCACCCCTC CAGCTGGCCC CTCGAGTAGG 

ATG GTA GCA CGC AAC CAG GTG GCA GCC GAC AAT G&G ATC TCC CCG GCA 
Met Val Ala Arg Asn Gin Val Ala Ala Asp Asn Kla. lie Ser Pro Ala 
1 5 10/15 

GCA GAG CCC CGA CGG CGG TCA GAG CCC TCC TCG TCC TCG TCT TCG TCC 
Ala Glu Pro Arg Arg Arg Ser Glu Pro Ser Ser Ser Ser Ser Ser Ser 
20 25 / 30 

TCG CCA GCG GCC CCC GTG CGT CCC CGG CCtf TGC CCG GCG GTC CCA GCC 
Ser Pro Ala Ala Pro Val Arg Pro Arg Ptfo Cys Pro Ala Val Pro Ala 
35 40 / 45 

CCA GCC CCT GGC GAC ACT CAC TTC CGC/ACC TTC CGC TCC CAC TCC GAT 
Pro Ala Pro Gly Asp Thr His Phe Arcf Thr Phe Arg Ser His Ser Asp 
50 55 / 60 

TAC CGG CGC ATC ACG CGG ACC AGC £CG CTC CTG GAC GCC TGC GGC TTC 
Tyr Arg Arg lie Thr Arg Thr Ser/Ala Leu Leu Asp Ala Cys Gly Phe 
65 70 / 75 80 



TAT TGG GGA CCC CTG AGC GTG 
Tyr Trp Gly Pro Leu Ser Val 

85 

GAG CCC GTG GGC ACC TTC TTC. 
Glu Pro Val Gly Thr Phe L« 
100 



IC GGG GCG CAC GAG CGG CTG CGT GCC 
.s Gly Ala His Glu Arg Leu Arg Ala 
90 95 

GTG CGC GAC AGT CGT CAA CGG AAC TGC 
Val Arg Asp Ser Arg Gin Arg Asn Cys 
105 110 



TTC TTC GCG CTC AGC GTG AAG ATG GCT TCG GGC CCC ACG AGC ATC CGC 
Phe Phe Ala Leu Ser Val/Lys Met Ala Ser Gly Pro Thr Ser lie Arg 
115 / 120 125 



GTG CAC TTC CAG GCC 

Val His Phe Gin Ala 
130 

TTC GAC TGC CTT TTC 

Phe Asp Cys Leu PI 
145 



SC CGC TTC CAC TTG GAC GGC AGC CGC GAG ACC 
Ly Arg Phe His Leu Asp Gly Ser Arg Glu Thr 
135 140 

GAG CTG CTG GAG CAC TAC GTG GCG GCG CCG CGC 
Glu Leu Leu Glu His Tyr Val Ala Ala Pro Arg 
150 155 160 



CGC ATG TTG GGG £CC CCG CTG CGC CAG CGC CGC GTG CGG CCG CTG CAG 
Arg Met Leu Gly/Ala Pro Leu Arg Gin Arg Arg Val Arg Pro Leu Gin 

165 170 175 

GAG CTG TGT C^C CAG CGC ATC GTG GCC GCC GTG GGT CGC GAG AAC CTG 
Glu Leu Cys AVg Gin Arg lie Val Ala Ala Val Gly Arg Glu Asn Leu 
'.80 185 190 



160 
208 

256 

304 

352 

400 

448 

496 

544 

592 

640 

688 

736 



• 
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GCG CGC ATC CCT CTT AAC CCG GTA CTC CGT GAC TAC/CTG AGT TCC TTC 
Ala Arg lie Pro Leu Asn Pro Val Leu Arg Asp Tyy Leu Ser Ser Phe 
195 200 / 205 

CCC TTC CAG ATC TGA CCGGCTG CCGCTGTGCC GCAGC/tTAA GTGGGGGCGC 
Pro Phe Gin lie 
210 

CTTATTATTT CTTATTATTA ATTATTATTA TTTTTCTGGA ACCACGTGGG AGCCCTCCCC 
GCCTGGGTCG GAGGGAGTGG TTGTGGAGGG TGAGAT/GCCT CCCACTTCTG GCTGGAGACC 
TCATCCCACC TCTCAGGGGT GGGGGTGCTC CCCT0CTGGT GCTCCCTCCG GGTCCCCCCT 
GGTTGTAGCA GCTTGTGTCT GGGGCCAGGA CCTGAATTCC ACTCCTACCT CTCCATGTTT 
ACATATTCCC AGTATCTTTG CACAAACCAG GpSTCGGGGA GGGTCTCTGG CTTCATTTTT 
CTGCTGTGCA GAATATCCTA TTTTATATTT /TTACAGCCAG TTTAGGTAAT AAACTTTATT 
ATGAAAGTTT TTTTTTAAAA GAAAAAAAAA AAAAAAAAA 
(2) INFORMATION FOR SEQ ID NO: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 212/ amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: /linear 

(ii) MOLECULE TYPE :/ protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Val Ala Arg Asn Gin Val Ala Ala Asp Asn Ala lie Ser Pro Ala 
1 5 / 10 15 

Ala Glu Pro Arg Arg Arg Ser Glu Pro Ser Ser Ser Ser Ser Ser Ser 
20 / 25 30 

Ser Pro Ala Ala P^o Val Arg Pro Arg Pro Cys Pro Ala Val Pro Ala 
35 / 40 45 

Pro Ala Pro Gly/Asp Thr His Phe Arg Thr Phe Arg Ser His Ser Asp 
50 / 55 60 

Tyr Arg Arg lAe Thr Arg Thr Ser Ala Leu Leu Asp Ala Cys Gly Phe 
65 7 70 75 80 

Tvr Trp Gly Pro Leu Ser Val His Gly Ala His Glu Arg Leu Arg Ala 

85 90 95 



784 

836 

896 
956 
1016 
1076 
1136 
1196 
1235 
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Glu Pro Val Gly Thr Phe Leu Val Arg Asp Ser Arg /Gin Arg Asn Cys 
100 105 / 110 

Phe Phe Ala Leu Ser Val Lys Met Ala Ser Gly Pro Thr Ser lie Arg 

115 120 / 125 

i / 

Val His Phe Gin Ala Gly Arg Phe His Leu Asp Gly Ser Arg Glu Thr 
130 135 / 140 

Phe Asp Cys Leu Phe Glu Leu Leu Glu His /Tyr Val Ala Ala Pro Arg 
145 150 / 155 160 

Arg Met Leu Gly Ala Pro Leu Arg Gin Arg Arg Val Arg Pro Leu Gin 

165 170 175 

Glu Leu Cys Arg Gin Arg lie Val Alsf Ala Val Gly Arg Glu Asn Leu 
180 195 190 

Ala Arg lie Pro Leu Asn Pro Val Zeu Arg Asp Tyr Leu Ser Ser Phe 
195 200/ 205 

Pro Phe Gin lie / 
210 / 



(2) INFORMATION FOR SEQ ID /NO: 5: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 112/1 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY^ linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: / 

(A) NAME /KEY: CDS 

(B) LOCATION: 223.. 819 

(xi) SEQUENCE /DESCRIPTION: SEQ ID NO: 5: 

GCGATCTGTG GGTGACAGTG TCTGCGAGAG ACTTTGCCAC ACCATTCTGC CGGAATTTGG 60 

AGAAAAAGAA CCAGCCGCTT CCAGTCCCCT CCCCCTCCGC CACCATTTCG GACACCCTGC 120 

ACACTCTCGT TTTGGGGTAC CCTGTGACTT CCAGGCAGCA CGCGAGGTCC ACTGGCCCCA 180 

GCTCGGGCGA CGeAGCTGTCT GGGACGTGTT GACTCATCTC CC ATG ACC CTG CGG 234 
/ Met Thr Leu Arg 

/ 1 
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TGC CTG GAG CCC TCC GGG AAT GGA GCG GAC AGG 
Cys Leu Glu Pro Ser Gly Asn Gly Ala Asp Arg 
5 10 15 





CGG AGC CAG TGG 
r Arg Ser Gin Trp 
20 



GGG ACC GCG GGG TTG CCG GAG GAA CAG TCC CQC GAG GCG GCG CGT CTG 
Gly Thr Ala Gly Leu Pro Glu Glu Gin Ser Bro Glu Ala Ala Arg Leu 

25 30 / 35 



GCG AAA GCC CTG CGC GAG CTC AGT CAA ACA GGA TGG TAC TGG GGA AGT 
Ala Lys Ala Leu Arg Glu Leu Ser Gin Thr Gly Trp Tyr Trp Gly Ser 
40 45 / 50 



282 



330 



378 



ATG ACT GTT AAT GAA GCC AAA GAG 
Met Thr Val Asn Glu Ala Lys Glu 
55 60 



TTA AAA GAG GCT CCA GAA GGA 
Leu Lys Glu Ala Pro Glu Gly 
65 



426 



ACT TTC TTG ATT AGA GAT AGT TCG/CAT TCA GAC TAC CTA CTA ACT ATA 
Thr Phe Leu lie Arg Asp Ser Ser His Ser Asp Tyr Leu Leu Thr lie 
70 75 / 80 



474 



TCC GTT AAG ACG TCA GCT GGA /CCG ACT AAC CTG CGG ATT GAG TAC CAA 
Ser Val Lys Thr Ser Ala Gly/ Pro Thr Asn Leu Arg lie Glu Tyr Gin 
85 90 / 95 100 



522 



GAT GGG AAA TTC AGA TTG GAT TCT ATC ATA TGT GTC AAG TCC AAG CTT 
Asp Gly Lys Phe Arg Leu Asp Ser lie lie Cys Val Lys Ser Lys Leu 
105 / 110 115 



570 



AAA CAG TTT GAC AGT GT/G GTT CAT CTG ATT GAC TAC TAT GTC CAG ATG 
Lys Gin Phe Asp Ser Val Val His Leu lie Asp Tyr Tyr Val Gin Met 
120 / 125 130 



618 



TGC AAG GAT AAA CGG/ ACA GGC CCA GAA GCC CCA CGG AAT GGG ACT GTT 
Cys Lys Asp Lys Arg Thr Gly Pro Glu Ala Pro Arg Asn Gly Thr Val 
135 / 140 145 



666 



CAC CTG TAC CTG ACC AAA CCT CTG TAT ACA TCA GCA CCC ACT CTG CAG 
His Leu Tyr Leu Thr Lys Pro Leu Tyr Thr Ser Ala Pro Thr Leu Gin 
150 / 155 160 



714 



CAT TTC TGT CGK CTC GCC ATT AAC AAA TGT ACC GGT ACG ATC TGG GGA 
His Phe Cys ArA Leu Ala lie Asn Lys Cys Thr Gly Thr lie Trp Gly 
165 / 170 175 180 



762 



CTG CCT TTA CCA ACA AGA CTA AAA GAT TAC TTG GAA GAA TAT AAA TTC 
Leu Pro Leu /Pro Thr Arg Leu Lys Asp Tyr Leu Glu Glu Tyr Lys Phe 
185 190 195 



810 



CAG GTA TA^GTATTTC TCTCTCTTTT TCGTTTTTTT TTAAAAAAAA AAAAACACAT 866 
Gin Val 

GCCTCATABA GACTATCTCC GAATGCAGCT ATGTGAAAGA GAACCCAGAG GCCCTCCTCT 926 
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GGATAACTGC GCAGAATTCT CTCTTAAGGA CAGTTGGGCT C^GTCTAACT TAAAGGTGTG 986 

AAGATGTAGC TAGGTATTTT AAAGTTCCCC TTAGGTAGTT TJTAGCTGAAT GATGCTTTCT 1046 

TTCCTATGGC TGCTCAAGAT CAAATGGCCC TTTTAAATGA/AACAAAACAA AACAAAACAA 1106 

AAAAAAAAAA AAAAA / 1121 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 198 amino acidi 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SE0 ID NO: 6: 



Met Thr Leu Arg Cys Leu Glu Pro 
1 5 



Set Gly Asn Gly Ala Asp Arg Thr 
10 15 



Arg Ser Gin Trp Gly Thr Ala Gly ten Pro Glu Glu Gin Ser Pro Glu 
20 / 25 30 

Ala Ala Arg Leu Ala Lys Ala Leii Arg Glu Leu Ser Gin Thr Gly Trp 
35 m 45 



Tyr Trp Gly Ser Met Thr Val 
50 55 



Ala Pro Glu Gly Thr Phe Lei 
65 70 



jfsn Glu Ala Lys Glu Lys Leu Lys Glu 
60 

lie Arg Asp Ser Ser His Ser Asp Tyr 



75 



80 



Leu Leu Thr lie Ser Val L} 

85 

lie Glu Tyr Gin Asp Gly 
100 



■s Thr Ser Ala Gly Pro Thr Asn Leu Arg 



90 



95 



jys Phe Arg Leu Asp Ser lie lie Cys Val 



105 



110 



Lys Ser Lys Leu Lys Gl} 
115 



Phe Asp Ser Val Val His Leu lie Asp Tyr 



120 



125 



Tyr Val Gin Met Cys 
130 

Asn Gly Thr Val Hisj 
145 



rs Asp Lys Arg Thr Gly Pro Glu Ala Pro Arg 



135 



140 



Leu Tyr Leu Thr Lys Pro Leu Tyr Thr Ser Ala 



150 



155 



160 



Pro Thr Leu Gin His Phe Cys Arg Leu Ala lie Asn Lys Cys Thr Gly 



170 



175 



Thr lie Trp Gly Leu Pro Leu Pro Thr Arg Leu Lys/ Asp Tyr Leu Glu 
180 185 / 190 

Glu Tyr Lys Phe Gin Val 
195 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2187 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 18.. 695 

(xi) SEQUENCE DESCRIPTION: 



2Q ID NO: 7; 



CGCTGGCTCC GTGCGCC ATG GTC ACC/CAC AGC AAG TTT CCC GCC GCC GGG 

Met Val Thr/ His Ser Lys Phe Pro Ala Ala Gly 
1/5 10 



50 



ATG AGC CGC CCC CTG GAC ACC AGC CTG CGC CTC AAG ACC TTC AGC TCC 
Met Ser Arg Pro Leu Asp Thr/Ser Leu Arg Leu Lys Thr Phe Ser Ser 
15 / 20 25 



98 



AAA AGC GAG TAC CAG CTG 
Lys Ser Glu Tyr Gin Leu 
30 



?G GTG AAC GCC GTG CGC AAG CTG CAG GAG 
il Val Asn Ala Val Arg Lys Leu Gin Glu 
35 40 



146 



AGC GGA TTC TAC TGG AGC/GCC GTG ACC GGC GGC GAG GCG AAC CTG CTG 
Ser Gly Phe Tyr Trp Sey Ala Val Thr Gly Gly Glu Ala Asn Leu Leu 
45 / 50 55 



194 



CTC AGC GCC GAG CCC 
Leu Ser Ala Glu Pro 
60 



:G GGC ACC TTT CTT ATC CGC GAC AGC TCG GAC 
Gly Thr Phe Leu lie Arg Asp Ser Ser Asp 
65 70 75 



242 



CAG CGC CAC TTC TTG" ACG TTG AGC GTC AAG ACC CAG TCG GGG ACC AAG 
Gin Arg His Phe Php Thr Leu Ser Val Lys Thr Gin Ser Gly Thr Lys 

85 90 



290 



AAC CTA CGC ATC GAG TGT GAG GGG GGC AGC TTT TCG CTG CAG AGT GAC 
Asn Leu Arg lie pin Cys Glu Gly Gly Ser Phe Ser Leu Gin Ser Asp 
95/ 100 105 



338 
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CCC CGA AGC ACG CAG CCA GTT CCC CGC TTC GAC/TGT GTA CTC AAG CTG 
Pro Arg Ser Thr Gin Pro Val Pro Arg Phe Asp/ Cys Val Leu Lys Leu 
110 115 / 120 



386 



GTG CAC CAC TAC ATG CCG CCT CCA GGG ACC CCC TCC TTT TCT TTG CCA 
Val His His Tyr Met Pro Pro Pro Gly Thr Pro Ser Phe Ser Leu Pro 
125 130 / 135 



434 



CCC ACG GAA CCC TCG TCC GAA GTT CCG GAjB CAG CCA CCT GCC CAG GCA 
Pro Thr Glu Pro Ser Ser Glu Val Pro Glm Gin Pro Pro Ala Gin Ala 
140 145 / 150 155 



482 



CTC CCC GGG AGT ACC CCC AAG AGA GCT/tAC TAC ATC TAT TCT GGG GGC 
Leu Pro Gly Ser Thr Pro Lys Arg Ala/ Tyr Tyr lie Tyr Ser Gly Gly 
160 / 165 170 



530 



GAG AAG ATT CCG CTG GTA CTG AGC CGA CCT CTC TCC TCC AAC GTG GCC 
Glu Lys lie Pro Leu Val Leu Ser Arg Pro Leu Ser Ser Asn Val Ala 
175 /180 185 



578 



ACC CTC CAG CAT CTT TGT CGG AA<3 ACT GTC AAC GGC CAC CTG GAC TCC 
Thr Leu Gin His Leu Cys Arg hyp Thr Val Asn Gly His Leu Asp Ser 
190 195 200 



626 



TAT GAG AAA GTG ACC CAG CTG (CCT GGA CCC ATT CGG GAG TTC CTG GAT 
Tyr Glu Lys Val Thr Gin Leu /Pro Gly Pro lie Arg Glu Phe Leu Asp 
205 210/ 215 



674 



CAG TAT GAT GCT CCA CTT TAAGGAGCAA AAGGGTCAGA GGGGGGCCTG 
Gin Tyr Asp Ala Pro Leu / 
220 225 / 


722 


GGTCGGTCGG 


TCGCCTCTCC 


TOCGAGGCAC 


ATGGCACAAG 


CACAAAAATC 


CAGCCCCAAC 


782 


GGTCGGTAGC 


TCCCAGTGAG 


CCAGGGGCAG 


ATTGGCTTCT 


TCCTCAGGCC 


CTCCACTCCC 


842 


GCAGAGTAGA 


GCTGGCAGGA 


iCCTGGAATTC 


GTCTGAGGGG 


AGGGGGAGCT 


GCCACCTGCT 


902 


TTCCCCCCTC 


CCCCAGCTCQ 


AGCTTCTTTC 


AAGTGGAGCC 


AGCCGGCCTG 


GCCTGGTGGG 


962 


ACAATACCTT 


TGACAAGCGG 


ACTCTCCCCT 


CCCCTTCCTC 


CACACCCCCT 


CTGCTTCCCA 


1022 


AGGGAGGTGG 


GGACACOTCC 


AAGTGTTGAA 


CTTAGAACTG 


CAAGGGGAAT 


CTTCAAACTT 


1082 


TCCCGCTGGA 


acttgt™?gc 


GCTTTGATTT 


GGTTTGATCA 


AGAGCAGGCA 


CCTGGGGGAA 


1142 


GGATGGAAGA 


GAAAAGGGTG 


TGTGAAGGGT 


TTTTATGCTG 


GCCAAAGAAA 


TAACCACTCC 


1202 


CACTGCCCAA 


CCTAGGTGAG 


GAGTGGTGGC 


TCCTGGCTCT 


GGGGAGAGTG 


GCAAGGGGTG 


1262 


ACCTGAAGAG 


AGCTATACTG 


GTGCCAGGCT 


CCTCTCCATG 


GGGCAGCTAA 


TGAAACCTCG 


1322 




CAGATCCCTT GCACCCCAGA ACCCTCCCCG TTGTGAAGAG GCXGTAGCAT TTAGAAGGGA 
GACAGATGAG GCTGGTGAGC TGGCCGCCTT TTCCAACACC GAAGGGAGGC AGATCAACAG 
ATGAGCCATC TTGGAGCCCA GGTTTCCCCT GGAGCAGATg/gaGGGTTCTG CTTTGTCTCT 
CCTATGTGGG GCTAGGAGAC TCGCCTTAAA TGCCCTCTOT CCCAGGGATG GGGATTGGCA 
CACAAGGAGC CAAACACAGC CAATAGGCAG AGAGTTGAGG GATTCACCCA GGTGGCTACA 
GGCCAGGGGA AGTGGCTGCA GGGGAGAGAC CCAGT<2?ACTC CAGGAGACTC CTGAGTTAAC 
ACTGGGAAGA CATTGGCCAG TCCTAGTCAT CTCTCGGTCA GTAGGTCCGA GAGCTTCCAG 
GCCCTGCACA GCCCTCCTTT CTCACCTGGG GGfSAGGCAGG AGGTGATGGA GAAGCCTTCC 
CATGCCGCTC ACAGGGGCCT CACGGGAATG OAGCAGCCAT GCAATTACCT GGAACTGGTC 
CTGTGTTGGG GAGAAACAAG TTTTCTGAAG/ TCAGGTATGG GGCTGGGTGG GGCAGCTGTG 
TGTTGGGGTG GCTTTTTTCT CTCTGTTTTG AATAATGTTT ACAATTTGCC TCAATCACTT 
TTATAAAAAT CCACCTCCAG CCCGCCCOTC TCCCCACTCA GGCCTTCGAG GCTGTCTGAA 
GATGCTTGAA AAACTCAACC AAATCCCAGT TCAACTCAGA CTTTGCACAT ATATTTATAT 
TTATACTCAG AAAAGAAACA TTTCA0TAAT TTATAATAAA AGAGCACTAT TTTTTAATGA 
AAAAAAAAAA AAAAAAAAAA 
(2) INFORMATION FOR SEQ IjfD NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH* 225 amino acids 

(B) TYPE: /amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE /DESCRIPTION: SEQ ID NO: 8: 

Met Val Thr His Sey Lys Phe Pro Ala Ala Gly Met Ser Arg Pro Leu 
1/5 10 15 

Asp Thr Ser Leu Atg Leu Lys Thr Phe Ser Ser Lys Ser Glu Tyr Gin 
20 / 25 30 

Leu Val Val Asn/Ala Val Arg Lys Leu Gin Glu Ser Gly Phe Tyr Trp 
35 / 40 45 

Ser Ala Val Thr Gly Gly Glu Ala Asn Leu Leu Leu Ser Ala Glu Pro 
50 / 55 60 



1382 
1442 
1502 
1562 
1622 
1682 
1742 
1802 
1862 
1922 
1982 
2042 
2102 
2162 
2187 



Ala Gly Thr Phe Leu lie Arg Asp 
65 70 

Thr Leu Ser Val Lys Thr Gin Ser 
85 

Cys Glu Gly Gly Ser Phe Ser Leu 
100 

Pro Val Pro Arg Phe Asp Cys Val 
115 120 

Pro Pro Pro Gly Thr Pro Ser Phe 
130 135 

Ser Glu Val Pro Glu Gin Pro Pro 
145 150 

Pro Lys Arg Ala Tyr Tyr lie Tyr 
165 

Val Leu Ser Arg Pro Leu Ser Ser 
180 

Cys Arg Lys Thr Val Asn Gly Hi/s 
195 2/60 

Gin Leu Pro Gly Pro lie Arg/Glu 
210 215) 

Leu 
225 



(2) INFORMATION FOR SEQ/ID NO: 9 





Ser Ser Asp 
75 

Gly Thr Lys 
90 

Gin Ser Asp/ 
105 

Leu Lys l/eu 
Ser 

Ala jSln Ala 
155 

Gly Gly 
170 

ten Val Ala 
185 

Leu Asp Ser 
Phe Leu Asp 



Gin /Irg His Phe Phe 
80 

Aj^i Leu Arg lie Gin 
95 

Pro Arg Ser Thr Gin 
110 

Val His His Tyr Met 
125 



Pro Thr Glu Pro Ser 
140 

Leu Pro Gly Ser Thr 
160 

Glu Lys lie Pro Leu 
175 

Thr Leu Gin His Leu 
190 

Tyr Glu Lys Val Thr 
205 

Gin Tyr Asp Ala Pro 
220 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1094 base pairs 

(B) TYPE/ nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPjDLOGY: linear 

(ii) MOLECULE /TYPE: DNA 

<xi) SEQUENCp DESCRIPTION: SEQ ID NO: 9: 
CTCCGGCTGG CCCCTT^TGT AGGATGGTAG CACACAACCA GGTGGCAGCC GACAATGCAG 60 
TCTCCACAGC AGCAG^GCCC CGACGGCGGC CAGAACCTTC CTCCTCTTCC TCCTCCTCGC 120 
CCGCGGCCCC CGCGGfGCCCG CGGCCGTGCC CCGCGGTCCC GGCCCCGGCC CCCGGCGACA 180 
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CGCACTTCCG CACATTCCGT TCGCACGCCG ATTACCGGCG CATCACGCGC GCCAGCGCGC 240 

TCCTGGACGC CTGCGGATTC TACTGGGGGC CCCTGAGCGT GC^ACGGGGCG CACGAGCGGC 300 

TGCGCGCCGA GCCCGTGGGC ACCTTCCTGG TGCGCGACAG /CCGCCAGCGG AACTGCTTTT 360 

TCGCCCTTAG CGTGAAGATG GCCTCGGGAC CCACGAGCA^T CCGCGTGCAC TTTCAGGCCG 420 

GCCGCTTTCA CCTGGATGGC AGCCGCGAGA GCTTCGAQ^G CCTCTTCGAG CTGCTGGAGC 480 

ACTACGTGGC GGCGCCGCGC CGCATGCTGG GGGCCC<£GCT GCGCCAGCGC CGCGTGCGGC 540 

CGCTGCAGGA GCTGTGCCGC CAGCGCATCG TGGCQACCGT GGGCCGCGAG AACCTGGCTC 600 

GCATCCCCCT CAACCCCGTC CTCCGCGACT ACCfGAGCTC CTTCCCCTTC CAGATTTGAC 660 

CGGCAGCGCC CGCCGTGCAC GCAGCATTAA CT&GGATGCC GTGTTATTTT GTTATTACTT 720 

GCCTGGAACC ATGTGGGTAC CCTCCCCGGC OtGGGTTGGA GGGAGCGGAT GGGTGTAGGG 780 

GCGAGGCGCC TCCCGCCCTC GGCTGGAGAC ^AGGCCGCAG ACCCCTTCTC ACCTCTTGAG 840 

GGGGTCCTCC CCCTCCTGGT GCTCCCTCTG GGTCCCCCTG GTTGTTGTAG CAGCTTAACT 900 

GTATCTGGAG CCAGGACCTG AACTCGCACC TCCTACCTCT TCATGTTTAC ATATACCCAG 960 

TATCTTTGCA CAAACCAGGG GTTGGGQ^AG GGTCTCTGGC TTTATTTTTC TGCTGTGCAG 1020 

AATCCTATTT TATATTTTTT AAAGTQfAGTT TAGGTAATAA ACTTTATTAT GAAAGTTTTT 1080 

TTTTTTAAAA AAAA / 1094 
(2) INFORMATION FOR SEQ it NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21/1 amino acids 

(B) TYPE: amiAo acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY/: linear 

lii) MOLECULE TYP£: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Val Ala HLfe Asn Gin Val Ala Ala Asp Asn Ala Val Ser Thr Ala 
1/5 10 15 



Ala Glu Pro 



:g Arg Arg Pro Glu Pro Ser Ser Ser Ser Ser Ser Ser 
!0 25 30 



Pro Ala Ala/ Pro Ala Arg Pro Arg Pro Cys Pro Ala Val Pro Ala Pro 
35 / 40 45 
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Ala Pro Gly Asp Thr His Phe Arg Thr Phe Arg/ Ser His Ala Asp Tyr 
50 55 / 60 

Arg Arg lie Thr Arg Ala Ser Ala Leu Leu lisp Ala Cys Gly Phe Tyr 
65 70 /75 80 

Trp Gly Pro Leu Ser Val His Gly Ala His Glu Arg Leu Arg Ala Glu 

85 9(9 95 

Pro Val Gly Thr Phe Leu Val Arg Asp yser Arg Gin Arg Asn Cys Phe 
100 105/ 110 

Phe Ala Leu Ser Val Lys Met Ala Ser Gly Pro Thr Ser lie Arg Val 
115 120 / 125 

His Phe Gin Ala Gly Arg Phe His/Leu Asp Gly Ser Arg Glu Ser Phe 
130 135 / 140 

Asp Cys Leu Phe Glu Leu Leu Glu His Tyr Val Ala Ala Pro Arg Arg 
145 150 / 155 160 

Met Leu Gly Ala Pro Leu Arg/ Gin Arg Arg Val Arg Pro Leu Gin Glu 

165 / 170 175 

Leu Cys Arg Gin Arg lie Val Ala Thr Val Gly Arg Glu Asn Leu Ala 
180 / 185 190 

Arg lie Pro Leu Asn Pro Val Leu Arg Asp Tyr Leu Ser Ser Phe Pro 
195 / 200 205 

Phe Gin He 
210 

(2) INFORMATION FOR SEQ flD NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2807 base pairs 

(B) TYPE/ nucleic acid 

(C) STR5QSFDEDNESS : single 

(D) TOPpLOGY: linear 

(ii) MOLECULE^ TYPE : DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GGAAACCGAG GCGGGGAGAC CAGGAGGCCT TGGCCTCAGA GCTTCAGAGT CGCGTGGCAG 
CAAACAGAGA AACCTGTAGA GGGCAGTGTG CGTCACTTAG CTCAGGGAAG CTGCACGCGA 
AACTCACCCG CCTTCATTCA TAAACATCGT CAGCTAGGCA CCTACTCCTG GGCTTTCAGG 
ACAAACTGAA TCACGAAACC ACAGTGTCCT TAAAATAGGT CTGACCGCCT GAATCCCTGG 



60 
120 
180 
240 



# • / 
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CCAAGGTGTG TACGGGGCAT GGGAGCCCTT GTGCAGAGAT GCTTXSCAGGA GCCTTGAGGG 300 

GCTCTGTAAG ACAGAGGCTA GGAAGACAAA GTTGGGGGCT ACAGCTTCTT GTCCTGCCCG 360 

GGGCCTCAGT TTCTTCGGTT GCCCACGTAG GAGTGCAGAG AGTCCAGCCC CTGGGGACCC 420 

AACCCAACCC CGCCCAGTTT CCGAGGAACT CGTCCGGGAG CGGGGGCGCC CCTCCCGCAC 480 

CGCCTTAGGC TTCCTTTGAA GCCTCTGCGG TCAGGCCACGT GCTTCCTGGG AAGCCCAAGC 540 

CAAGGCCAGG CCGAGTGGCC AACGGGAGGG GCCCGCGCSC GATTCTGGAG GAGGGCGGCG 600 

GCCCCACAGG TCTCCAGGGC TGGCTAGCCG GGCTCCTAGA GCGGAGACTG CCAAGGCCTT 660 

CGGGTCCTGG GCAGGAAGGA TCCTGGCAGG GAGGASTTGC TTGGGGGGTG GGGGGGAAAG 720 

GCTCCAGGCG CGGTGGAGCT CTGACCAGGA GAAT|GCACAC ACTCGGAGGG GAGGAGGCGT 780 

GTCAGCCCCA AGCTAGCATC CCACCCGGGG AGdAGCGATG TGGGGCGAAG GTAGCCAGAG 840 

CAAAAGAGCA GGCACCAGGT GACACGAAAC ACSAAGATTCC GGGTAGAGCC AGAACCCCAG 900 

AAGTCCCATT CAGGGAAGGT GCGAGGCGAG yAACGAGTTAG GTGGACCCTC TCCAGGGGCA 960 

GCCAAAGAAA TCTAAAGAGA ACCCGAAGGA CTTGCCGGAA AGAGAAACCG AAAGCGGCGG 1020 

TGGGCGGGAT CGGTGGGCGG GGCCTCCCTG GTTTAAGAGC TTGATGCAGG GGCGGGCAGC 1080 

AGCAGAGAGA ACTGCGGCCG TGGCAGCGGC ACGGCTCCCG GCCCCGGAGC ATGCGCGACA 1140 

GCAGCCCCGG AACCCCCAGC CGCGGOGCCC CGCGTCCCGC CGCCAGGTGA GCCGAGGCAG 1200 

CTGCGAAGGA GCAGGCGGGA GGGGATGGGA GGAAGGGGAG CAGAGCCTGG CAGGACTATC 1260 

CTCGCAGACT GCATGGCGGG GTCGTGGATG CTATGCCTCT GGCGCCCGCC CCACCGGCTG 1320 

GCCCAGGCGG CCCCTCGCGC GQGCGGGGCG CCGTCAGCCC CTCCTCTCCG GCCCTGAGCC 1380 

CGGATCGTCC GCCCGGGTTC OAGTTCCCGG CGTGGCCAGT AGGCGGCAAC CGCGAGGCGG 1440 

CAAGCCACCC AGCGGGGACG /gCCTGGAGTC GGGCCCCTCT CCACGCCCCC TTCTCCACGC 1500 
GCGCGGGGAG GCAGGGCTCS ACCGCCAGTC TGGAAGGGTT CCACATACAG GAACGGCCTA 1560 

CTTCGCAGAT GAGCCCACffiG AGGCTCAGGC TCCGGGCGGA TTCTGCGTGT CACCCTCGCT 1620 

CCTTGGGGTC CGCTGGCCGG CCTGTGCCAC CCGGACGCCC GGTTCACTGC CTCTGTCTCC 1680 

CCCATCAGCG CAGCCCCGGA CGCTATGGCC CACCCCTCCA GCTGGCCCCT CGAGTAGGAT 1740 

GGTAGCACGT AACCAGGTGG AAGCCGACAA TGCGATCTCC CCGGCATCAG AGCCCCGACG 1800 
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GCGGCCAGAG CCATCCTCGT CCTCGTCTTC GTCCTCGCCG GCGfiCCCCGG CGCGTCCCCG 
GCCCTGCCCG GTGGTCCCGG CCCCGGCTCC GGGCGACACT CACTTCCGCA CCTTCCGCTC 
CCACTCTGAT TACCGGCGCA TCACGCGGAC CAGCGCTCTC GTGGACGCCT GCGGCTTCTA 
CTGGGGACCC CTGAGCGTGC ATGGGGCGCA CGAACGGCTG/CGTTCCGAAC CCGTGGGCAC 
CTTCTTGGTG CGCGACAGTC GCCAGCGGAA CTGCTTCTTC GCGCTCAGCG TGAAGATGGC 
TTCGGGCCCC ACGAGCATTC GTGTGCACTT CCAGGCCGGC CGCTTCCACC TGGACGGCAA 



CCGCGAGACC TTCGACTGCC TCTTCGAGCT GCTGGAGCAC TACGTGGCGG CGCCGCGCCG 
CATGTTGGGG GCCCCACTGC GCCAGCGCCG CGTGGGGCCG CTGCAGGAGC TGTGTCGCCA 
GCGCATCGTG GCCGCCGTGG GTCGCGAGAA CCTGGCACGC ATCCCTCTTA ACCCGGTACT 
CCGTGACTAC CTGAGTTCCT TCCCCTTCCA GOTCTGACCG GCTGCCGCCG TGCCCGCAGA 
ATTAAGTGGG AGCGCCTTAT TATTTCTTAT 2DATTAATTAT TATTATTTTT CTGGAACCAC 
GTGGGAGCCC TCCCCGCCTA GGTCGGAGGG' AGTGGGTGTG GAGGGTGAGA TCCCTCCCAC 
TTCTGGCTGG AGACCTTATC CCGCCTCTGG GGGGGCCTCC CCTCCTGGTG CTCCCTCCCG 
GTCCCCCTGG TTGTAGCAGC TTGTGTCTGG GGCCAGGACC TGAACTCCAC GCCTACCTCT 
CCATGTTTAC ATGTTCCCAG TATCTTOGCA CAAACCAGGG GTGGGGGAGG GTCTCTGGCT 
TCATTTTTCT GCTGTGCAGA ATATTCTATT TTATATTTTT ACATCCAGTT TAGATAATAA 
ACTTTATTAT GAAAGTTTTT TTTT)TTAAAG AAACAAAGAT TTCTAGA 
(2) INFORMATION FOR SEQ 3D NO: 12 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 212 amino acids 

(B) TYPE: ami/no acid 

(C) STRANDED^ESS : single 

(D) TOPOLOGY*- linear 

(ii) MOLECULE TYRE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Val Ala Arg Asn Gin Val Glu Ala Asp Asn Ala lie Ser Pro Ala 
1/5 10 15 

Ser Glu Pro Airg Arg Arg Pro Glu Pro Ser Ser Ser Ser Ser Ser Ser 
2r0 25 30 



1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2807 
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Ser Pro Ala Ala Pro Ala Arg Pro Arg Pro Cys Pro Val Val 
35 40 / 45 



Pro Ala Pro Gly Asp Thr His Phe Arg Thr 
50 55 



?he Arg Ser His 
60 



Tyr Arg Arg lie Thr Arg Thr Ser Ala Leu Leu Asp Ala Cys 
65 70 / 75 

Tyr Trp Gly Pro Leu Ser Val His Gly/Ala His Glu Arg Leu 

85 / 90 



Pro Ala 
Ser Asp 



Gly Phe 
80 

Arg Ser 
95 



Glu Pro Val Gly Thr Phe Leu Val iyrg Asp Ser Arg Gin Arg 
100 £05 110 



Asn Cys 

Phe Phe Ala Leu Ser Val Lys Me^ Ala Ser Gly Pro Thr Ser lie Arg 

Glu Thr 



ij/ a: 



Val His Phe Gin Ala Gly Arg Phe His Leu Asp Gly Asn Arg 
130 135 / 140 

Phe Asp Cys Leu Phe Glu Leu Leu Glu His Tyr Val Ala Ala 
145 150 / 155 

Arg Met Leu Gly Ala Pro ten Arg Gin Arg Arg Val Arg Pro 

165 / 170 

Glu Leu Cys Arg Gin Arg lie Val Ala Ala Val Gly Arg Glu 
180 f 185 190 

Ala Arg lie Pro Leu Asn Pro Val Leu Arg Asp Tyr Leu Ser 
195 / 200 205 

Pro Phe Gin lie 
210 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH^ 1611 base pairs 

(B) TYPE: /nucleic acid 

(C) STRAI^DEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE /TYPE: DNA 

(ix) FEATURE/ 

(A) N^ME/KEY: CDS 

(B) LOCATION: 263.. 1529 




Pro Arg 
160 

Leu Gin 
175 

Asn Leu 
Ser Phe 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 



♦ 
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CGAATTCCGG GCGGGCTGTG TGAGTCTGTG AGTGGAAGGC /GCGCCGGCTC TTTTGTCTGA 60 

GTGTGACCCG GTGGCTTTGT TCCAGGCATT CCGGTGATTj/ CCTCCGGGCA GTCCGCAGAA 120 

GCCGCAGCGG CCGCCCGCGC TCTCTCTGCA GTCTCCAOAC CCGGGAGAGC CTGAGCCCGC 180 

GTCACGCCCC TCAGCCCCCG CTGAGTCCCT TCTCTGTTGT CGCGTCCGAA TCGAGTTCCC 240 

GGAATCAGAC GGTGCCCCAT AG ATG GCC AGC TJTT CCC CCG AGG GTT AAC GAG 292 

Met Ala Ser Phe Pro Pro Arg Val Asn Glu 
l/5 10 



AAA GAG ATC GTG AGA TCA CGT ACT ATA GGG GAA CTC TTG GCT CCA GCA 
Lys Glu lie Val Arg Ser Arg Thr lie Gly Glu Leu Leu Ala Pro Ala 

15 / 20 25 

GCT CCT TTT GAC AAG AAA TGT GGT/GGT GAG AAC TGG ACG GTT GCT TTT 
Ala Pro Phe Asp Lys Lys Cys Gly Gly Glu Asn Trp Thr Val Ala Phe 
30 / 35 40 

GCT CCT GAT GGT TCC TAC TTT (fcG TGG TCA CAA GGA TAT CGC ATA GTG 
Ala Pro Asp Gly Ser Tyr Phe yAla Trp Ser Gin Gly Tyr Arg lie Val 
45 / 50 55 

AAG CTT GTC CCG TGG TCC CAG TGC CGT AAG AAC TTT CTT TTG CAT GGT 
Lys Leu Val Pro Trp Ser Gin Cys Arg Lys Asn Phe Leu Leu His Gly 
60 /65 70 



TCC AAA AAT GTT ACC AAT /TCA AGC TGT CTA AAA TTG GCA AGA CAA AAC 
Ser Lys Asn Val Thr Asir Ser Ser Cys Leu Lys Leu Ala Arg Gin Asn 
75 8p 85 90 

AGT AAT GGT GGT CAG AAA AAC AAG CCT CCT GAG CAC GTT ATA GAC TGT 
Ser Asn Gly Gly Gin Lys Asn Lys Pro Pro Glu His Val lie Asp Cys 

95 / 100 105 



GGA GAC ATA GTC T 
Gly Asp He Val T 
110 




AGT CTT GCT TTT GGG TCT TCA GTT CCA GAA AAA 
Ser Leu Ala Phe Gly Ser Ser Val Pro Glu Lys 
115 120 



CAG AGT CGT TGC ©TT AAT ATA GAA TGG CAT CGG TTC CGA TTT GGA CAG 
Val Asn He Glu Trp His Arg Phe Arg Phe Gly Gin 
130 135 



Gin Ser Arg Cys 



125 



GAT CAG CTA CTg 
Asp Gin Leu L€ 
140 



CTT GCC ACA GGA TTA AAC AAT GGT CGC ATC AAA ATC 
Leu Ala Thr Gly Leu Asn Asn Gly Arg He Lys He 
145 150 



340 



388 



436 



484 



532 



580 



628 



676 



724 




# 
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TGG GAT GTA TAT ACA GGA AAA CTC CTC CTT AAT TTG/GTA GAC CAC ATT 
Trp Asp Val Tyr Thr Gly Lys Leu Leu Leu Asn Leu Val Asp His lie 
155 160 165 / 170 



772 



GAA ATG GTT AGA GAT TTA ACT TTT GCT CCA GAT/GGG AGC TTA CTC CTT 
Glu Met Val Arg Asp Leu Thr Phe Ala Pro Asp Gly Ser Leu Leu Leu 
175 180 / 185 



/ 



GTA TCA GCT TCA AGA GAC AAA ACT CTA AGA X3TG TGG GAC CTG AAA GAT 
Val Ser Ala Ser Arg Asp Lys Thr Leu Arg/ Val Trp Asp Leu Lys Asp 
190 195 / 200 



820 



868 



GAT GGA AAC ATG GTG AAA GTA TTG CGG GCA CAT CAG AAT TGG GTG TAC 
Asp Gly Asn Met Val Lys Val Leu Arg Ala His Gin Asn Trp Val Tyr 
205 210 / 215 



/ 



AGT TGT GCA TTC TCT CCC GAC TGT TOT ATG CTG TGT TCA GTG GGC GCC 
Ser Cys Ala Phe Ser Pro Asp Cys Ser Met Leu Cys Ser Val Gly Ala 
220 225 / 230 



916 



964 



AGT AAA GCA GTT TTC CTT TGG AAT/ ATG GAT AAA TAC ACC ATG ATT AGG 
Ser Lys Ala Val Phe Leu Trp Asn Met Asp Lys Tyr Thr Met lie Arg 
235 240 / 245 250 



AAG CTG GAA GGT CAT CAC CAT GAT GTT GTA GCT TGT GAC TTT TCT CCT 
Lys Leu Glu Gly His His His /Asp Val Val Ala Cys Asp Phe Ser Pro 
255 / 260 265 



1012 



1060 



GAT GGA GCA TTG CTA GCT ACT GCA TCC TAT GAC ACT CGT GTG TAT GTC 
Asp Gly Ala Leu Leu Ala TMr Ala Ser Tyr Asp Thr Arg Val Tyr Val 
270 / 275 280 



1108 



TGG GAT CCA CAC AAT GGA pAC CTT CTG ATG GAG TTT GGG CAC CTG TTT 
Trp Asp Pro His Asn Gly/Asp Leu Leu Met Glu Phe Gly His Leu Phe 
285 / 290 295 



1156 



CCC TCG CCC ACT CCA AyA TTT GCT GGA GGA GCA AAT GAC CGA TGG GTG 
Pro Ser Pro Thr Pro lie Phe Ala Gly Gly Ala Asn Asp Arg Trp Val 
300 / 305 310 



1204 



AGA GCT GTG TCT TTC /AGT CAT GAT GGA CTG CAT GTT GCC AGC CTT GCT 1252 
Arg Ala Val Ser Phe/ Ser His Asp Gly Leu His Val Ala Ser Leu Ala 
315 / 320 325 330 

GAT GAT AAA ATG GTjG AGG TTC TGG AGA ATC GAT GAG GAT TGT CCG GTA 1300 
Asp Asp Lys Met Val Arg Phe Trp Arg lie Asp Glu Asp Cys Pro Val 

\5 340 345 



CAA GTT GCA CCT j/TTG AGC AAT GGT CTT TGC TGT GCC TTT TCT ACT GAT 
Gin Val Ala Pro /Leu Ser Asn Gly Leu Cys Cys Ala Phe Ser Thr Asp 
350/ 355 360 



1348 
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GGC AGT GTT TTA GCT GCT GGG AC A CAT GAT GGA AG# GTG TAT TTT TGG 1396 
Gly Ser Val Leu Ala Ala Gly Thr His Asp Gly s/r Val Tyr Phe Trp 
365 370 / 375 

GCC ACT CCA AGG CAA GTC CCT AGC CTT CAA CKf ATA TGT CGC ATG TCA 1444 
Ala Thr Pro Arg Gin Val Pro Ser Leu Gin Ki/s lie Cys Arg Met Ser 
380 385 / 390 

ATC CGA AGA GTG ATG TCC ACC CAA GAA GTC /cAA AAA CTG CCT GTT CCT 1492 
lie Arg Arg Val Met Ser Thr Gin Glu Vay Gin Lys Leu Pro Val Pro 
395 400 / 405 410 

TCC AAA ATA TTG GCG TTT CTC TCC TAC QGC GGT TAG A CTGAAGACTG 1539 
Ser Lys lie Leu Ala Phe Leu Ser Tyr Arg Gly 
415 /420 

CCTTTCCTGG TAGGCCTGCC AGACAGAGCG G'CCTTTACAA GACACACCTC AAGCTTTACC 1599 

TCGTGCCGAA TT / 1611 

(2) INFORMATION FOR SEQ ID NO: 

(i) 



(ii) 



SEQUENCE CHA 

(A) LENGTH 

(B) TYP] 
(D) TOPOLOGY 

MOLECULE TYPE: 




/STICS: 
421 amino acids 
amino /acid 
mear 



(xi) SEQUENCE DESCR] 
Met Ala Ser Phe Pro Pro 



rotein 

PTION: SEQ ID NO: 14: 

:g Val Asn Glu Lys Glu lie Val Arg Ser 
10 15 



Arg Thr lie Gly Glu Leuy 
20 

Cys Gly Gly Glu Asn Ti 
35 



Leu Ala Pro Ala Ala Pro Phe Asp Lys Lys 
25 30 

Thr Val Ala Phe Ala Pro Asp Gly Ser Tyr 
40 45 



Phe Ala Trp Ser Gin 
50 

Gin Cys Arg Lys Asi 
65 



uy Tyr Arg lie Val Lys Leu Val Pro Trp Ser 
55 60 

Phe Leu Leu His Gly Ser Lys Asn Val Thr Asn 
70 75 80 



Ser Ser Cys Leu Lys Leu Ala Arg Gin Asn Ser Asn Gly Gly Gin Lys 

!5 90 95 



Asn Lys Pro Pro Glu His Val lie Asp Cys Gly Asp lie Val Trp Ser 
100 / 105 110 
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Leu Ala Phe Gly Ser Ser Val Pro Glu Lys Gin Se/ Arg Cys Val Asn 
115 120 / 125 

lie Glu Trp His Arg Phe Arg Phe Gly Gin Asp JGln Leu Leu Leu Ala 
130 135 /l40 

Thr Gly Leu Asn Asn Gly Arg lie Lys lie Trp Asp Val Tyr Thr Gly 
145 150 135 160 

Lys Leu Leu Leu Asn Leu Val Asp His lie yGlu Met Val Arg Asp Leu 

165 170/ 175 

Thr Phe Ala Pro Asp Gly Ser Leu Leu LeA Val Ser Ala Ser Arg Asp 
180 185 / 190 

Lys Thr Leu Arg Val Trp Asp Leu Lys Asp Asp Gly Asn Met Val Lys 
195 200 / 205 

Val Leu Arg Ala His Gin Asn Trp Vafl. Tyr Ser Cys Ala Phe Ser Pro 
210 215 / 220 

Asp Cys Ser Met Leu Cys Ser Val ply Ala Ser Lys Ala Val Phe Leu 
225 230 / 235 240 

Trp Asn Met Asp Lys Tyr Thr Met lie Arg Lys Leu Glu Gly His His 

245 / 250 255 

His Asp Val Val Ala Cys Asp Bhe Ser Pro Asp Gly Ala Leu Leu Ala 
260 / 265 270 

Thr Ala Ser Tyr Asp Thr Arg/ Val Tyr Val Trp Asp Pro His Asn Gly 
275 7 280 285 

Asp Leu Leu Met Glu Phe Gl'y His Leu Phe Pro Ser Pro Thr Pro lie 
290 2p5 300 

Phe Ala Gly Gly Ala Asn Asp Arg Trp Val Arg Ala Val Ser Phe Ser 
305 310/ 315 320 

His Asp Gly Leu His Val Ala Ser Leu Ala Asp Asp Lys Met Val Arg 

325 / 330 335 

Phe Trp Arg lie Asp Gfl.u Asp Cys Pro Val Gin Val Ala Pro Leu Ser 
340 / 345 350 

Asn Gly Leu Cys Cys /Ala Phe Ser Thr Asp Gly Ser Val Leu Ala Ala 
355 / 360 365 



Gly Thr His Asp Gly Ser Val Tyr Phe Trp Ala Thr Pro Arg Gin Val 
370 / 375 380 
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Pro Ser Leu Gin His lie Cys Arg Met Ser lie 
385 390 395 

Thr Gin Glu Val Gin Lys Leu Pro Val Pro Ser 
405 410 

Leu Ser Tyr Arg Gly 
420 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 783 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ tD NO: 15 
CTGTCTTCCT CCGCAGCGCG AGGCTGGGTA GfAGGGTCTAT 
CTTTGGTCTG AGGCCTTCGG GAGCTTTCCC /gAGGC AGTTA 
CCCCGCCCGT CTCCTCTGTC CCTGGGCCCffi GGAGACAAAC 
GTCGCCACTC TCTTCTCTGT TGTTGGGTCC GCATCGTATT 
CATAGATGGC CAGCTTTCCC CCGAGGGTCA ACGAGAAAGA 
TAGGTGAACT TTTAGCTCCT GCAGCTCCTT TTGACAAGAA 
CTGTTGCTTT TGCTCCAGAT GGTTCZATACT TTGCTTGGTC 
AGCTTGTTCC GTGGTCCCAG TGCGTTCAGA ACTTTCTCTT 
CCAATTCAAG CAGTTTAAGA TTSCCAAGAC AAAATAGTGA 
CTCGTGACAT ATTATAGACT G2TGGAGATAT AGTCTGGAGT 
TCCAGAAAAA CAGAGTCGCT GTGTAAATAT AGAATGGCAT 
TCAGCTACTT CTTGCTACAg/gGTTGAACAA TGGGCGTATC 
GGAAACTCCT CCTTAACTTjG GTAGATCATA CTGAAGTGGT 
CAG 

(2) INFORMATION FOyfl SEQ ID NO: 16: 
( i ) SEQUENCE/ CHARACTERISTICS : 



Arg /Arg Val Met Ser 
400 

L^s lie Leu Ala Phe 
415 



TGTCTGTGGT 


TGACTCCGTA 


60 


GCAGAAGCCG 


CAGCGACCGC 


120 


TTGGCGTCAC 


GCCCTCAGCG 


180 


CCCGGAATCA 


GACGGTGCCC 


240 


GATCGTGAGA 


TCACGTACTA 


300 


ATGTGGTCGT 


GAAAATTGGA 


360 


ACAAGGACAT 


CGCACAGTAA 


420 


GCATGGCACC 


AAGAATGTTA 


480 


TGGTGGTCAG 


AAAAATAAGC 


540 


CTTGCTTTTG 


GGTCATCAGT 


600 


CGCTTCAGAT 


TTGGACAAGA 


660 


AAAATATGGG 


ATGTATATCA 


720 


CAGAGATTTA 


ACTTTTGCTC 


780 






783 
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(A) LENGTH: 1122 base pairs / 

(B) TYPE: nucleic acid / 

(C) STRANDEDNESS : single / 

(D) TOPOLOGY: linear / 

(ii) MOLECULE TYPE: DNA / 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l/: 

CTCTGTATGT CTGAATGAAG CTATAACATT TGCCTTTTTA TTGCAGGTTT TCCTTTGGAA 60 

TATGGATAAA TACACCATGA TACGGAAACT AGAAGGACAT CACCATGATG TGGTAGCTTG 120 

TGACTTTTCT CCTGATGGAG CATTACTGGC TACTG^ATCT TATGATACTC GAGTATATAT 180 

CTGGGATCCA CATAATGGAG ACATTCTGAT GGAiwTTGGG CACCTGTTTC CCCCACCTAC 240 

TCCAATATTT GCTGGAGGAG CAAATGACCG GTGGGTACGA TCTGTATCTT TTAGCCATGA 300 

TGGACTGCAT GTTGCAAGCC TTGCTGATGA tAaATGGTG AGGTTCTGGA GAATTGATGA 360 

GGATTATCCA GTGCAAGTTG CACCTTTGAG CAATGGTCTT TGCTGTGCCT TCTCTACTGA 420 

TGGCAGTGTT TTAGCTGCTG GGACACATGA' CGGAAGTGTG TATTTTTGGG CCACTCCACG 480 

GCAGGTCCCT AGCCTGCAAC ATTTATGTaG CATGTCAATC CGAAGAGTGA TGCCCACCCA 540 

AGAAGTTCAG GAGCTGCCGA TTCCTTCCAA GCTTTTGGAG TTTCTCTCGT ATCGTATTTA 600 

GAAGATTCTG CCTTCCCTAG TAGTAGGGAC TGACAGAATA CACTTAACAC AAACCTCAAG 660 

CTTTACTGAC TTCAATTATC TGTTTTTAAA GACGTAGAAG ATTTATTTAA TTTGATATGT 720 

TCTTGTACTG CATTTTGATC AGTTGAGCTT TTAAAATATT ATTTATAGAC AATAGAAGTA 780 

TTTCTGAACA TATCAAATAT AAA^TTTTTTT AAAGATCTAA CTGTGAAAAC ATACATACCT 840 

GTACATATTT AGATATAAGC TGCTATATGT TGAATGGACC CTTTTGCTTT TCTGATTTTT 900 

AGTTCTGACA TGTATATATT GCTTCAGTAG AGCCACAATA TGTATCTTTG CTGTAAAGTG 960 

CAAGGAAATT TTAAATTCTG/gGACACTGAG TTAGATGGTA AATACTGACT TACGAAAGTT 1020 

GAATTGGGTG AGGCGGGCAA ATCACCTGAG GTCAGCAGTT TGAGACTAGC CTGGCAAACA 1080 

TGATGAAACC CTGTCTCTAC TAAAAATACA AAAAAAAAAA AA 1122 
(2) INFORMATION FOR SEQ ID NO: 17: 

{ i ) SEQUENCE /CHARACTERISTICS : 

(A) LENGTH: 2544 base pairs 

(B) TYBE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 423.. 2 030 

(xi) SEQUENCE DESCRIPTION: SEQ ID n6:17; 



CGGCACGAGC 


CGGGCTCCGT 


CCGGAGGAAG 


CGAGGCTGCG 


CCGCCGGCCC 


GGCAGGAGCG 


60 


GAGGACGGGA 


MGCGCGGGCG 


GTCGCGCTCG 


CCCTGTCGCT 


GACTGCGCTG 


CCCCGGCCCA 


120 


TCCTTGCCTG 


GCCGCAGGTG 


CCCTGGATGA 


GGCCGCCGCG 


CGTGTCCCGG 


CCGCTGAGTG 


180 


TCCCCCGCGG 


TCGCCCGGCG 


CCTGCCCTCA 


MJCGGCCGCC 


TCTCCTTGCC 


CGGGTCCCCG 


240 


TTTTCCCCCG 


GCGCAGTCCT 


CCTCCGGTGG, 


/gcgcctccgc 


ACCTCGGCGC 


AGGCGGCACG 


300 


GCCCTCGGGC 


CGGGATGGAT 


CCGCCGGGAA 


GAGGAAGACA 


AGCCGGGGCG 


TTGAGCCCCT 


360 


GCGCACGGTG 


CCGCCGCGCG 


TAGTGGG^SC 


TTACTCGCAG 


TAGGCTCTCG 


CTCTTCTAAT 


420 


CA ATG GAT 
Met Asp 


AAA GTG GGG AAA ATG TGG AAC AAC TTA AAA TAC 
Lys Val Gly Lys Met Trp Asn Asn Leu Lys Tyr 


AGA TGC 
Arg Cys 


467 



10 



15 



CAG AAT CTC TTC AGC CAC GAG/ GGA GGA AGC CGT AAT GAG AAC GTG GAG 
Gin Asn Leu Phe Ser His Glu Gly Gly Ser Arg Asn Glu Asn Val Glu 

20 / 25 30 



515 



ATG AAC CCC AAC AGA TGT 
Met Asn Pro Asn Arg Cys 
35 



:G TCT GTC AAA GAG AAA AGC ATC AGT CTG 
Ser Val Lys Glu Lys Ser lie Ser Leu 
40 45 



563 



GGA GAG GCA GCT CCC CAG/ CAA GAG AGC AGT CCC TTA AGA GAA AAT GTT 
Gly Glu Ala Ala Pro Gin Gin Glu Ser Ser Pro Leu Arg Glu Asn Val 
50 / 55 60 



611 



GCC TTA CAG CTG GGA CTG AGC CCT TCC AAG ACC TTT TCC AGG CGG AAC 
Ala Leu Gin Leu Gly lien Ser Pro Ser Lys Thr Phe Ser Arg Arg Asn 
65 / 70 75 

CAA AAC TGT GCC GCA/GAG ATC CCT CAA GTG GTT GAA ATC AGC ATC GAG 
Gin Asn Cys Ala Ala/ Glu He Pro Gin Val Val Glu He Ser He Glu 
80 / 85 90 95 

AAA GAC AGT GAC TGfG GGT GCC ACC CCA GGA ACG AGG CTT GCA CGG AGA 
Lys Asp Ser Asp Ser Gly Ala Thr Pro Gly Thr Arg Leu Ala Arg Arg 

105 110 



659 



707 



755 



01 



GAC TCC TAC TCG CGG CAC GCC CCG TGG GGA 
Asp Ser Tyr Ser Arg His Ala Pro Trp Gly 
115 120 

TGT TCC ACA AAG ACC CAG AGT TCA TTG GAT 
Cys Ser Thr Lys Thr Gin Ser Ser Leu Asp 
130 135 

AGA ACT CGA AGC GGC CTT CAG AGG CGA GAG 
Arg Thr Arg Ser Gly Leu Gin Arg Arg Glu 
145 150 

TCC ATG CAG GAC ATG GAC AGC GTT TCT AGC 
Ser Met Gin Asp Met Asp Ser Val Ser Ser 
160 165 

TCC CTG AGG CAG AGG CTC CAG GAC ACG GTG 
Ser Leu Arg Gin Arg Leu Gin Asp Thr 

180 18! 

AGA ACT TAC AGC AAG CAG TCA AAG CCA 
Arg Thr Tyr Ser Lys Gin Ser Lys Pro Ls 
195 200 

ATA CAT CTT TCT GAA TTA ATG CTG GAG 
lie His Leu Ser Glu Leu Met Leu Glu 
210 215 

TCG GAT TTA GCA CAA AAG TGG CAT TTG/ ATT 
Ser Asp Leu Ala Gin Lys Trp His hef lie 
225 230 

GTG AGC CCA CAC TCA ACA TTT TTT GAT ACA 
Val Ser Pro His Ser Thr Phe Phe A£p Thr 
240 245 

TCT ACA GAA GAT GAA GAA GAT AGG fcTT CGC 
Ser Thr Glu Asp Glu Glu Asp Arg /Leu Arg 

260 / 265 

ATC GAA GAA GGG GTG GAT CCC CCf CCC AAC 
lie Glu Glu Gly Val Asp Pro Pr6 Pro Asn 
275 / 280 



GGA AAG AftG AAA CAT TCC 803 

Gly Lys Lys Lys His Ser 
125 

ACC GAG 7 AAA AAG TTT GGT 851 

Thr Glju Lys Lys Phe Gly 
140 

CGG pGC TAT GGA GTC AGC 899 

bxqf&rq Tyr Gly Val Ser 
155 

CGC GCG GTC GGG AGC CGC 947 
•g Ala Val Gly Ser Arg 

0 175 

, T TTG TGT TTT ccc ATG 995 
Gly Leu Cys Phe Pro Met 

190 

TTT TCC AAT AAA AGA AAA 1043 
Phe Ser Asn Lys Arg Lys 
205 

TGC CCT TTT CCT GCT GGC 1091 
Cys Pro Phe Pro Ala Gly 
220 

CAT ACC GCC CCT 1139 
His Thr Ala Pro 



AAA 
Lys 



TTT 
Phe 
250 

GAG 
Glu 



CAG 
Gin 
235 

GAT 
Asp 



AGA 
Arg 



GAA GCT ACT GCA CAG GTC AAC 
Glu Ala Thr Ala Gin Val Asn 
290 



:A TTG TAT 
:o Leu Tyr 
f95 



GCT CCT GGG ATG ACA GAG ATA kGT GGA GAT 
Ala Pro Gly Met Thr Glu lie /Ser Gly Asp 
305 310, 



GCA CAA 
Ala Gin 



AAG CTG 
Lys Leu 



GGT TCT 
Gly Ser 
315 



CCA TCA CTG GTG 1187 
Pro Ser Leu Val 
255 

AGA CGG CTT AGT 1235 
Arg Arg Leu Ser 
270 

ATA CAC ACC TTT 1283 
lie His Thr Phe 
285 

GGA CCA AAG TTA 1331 

Gly Pro Lys Leu 

300 

GCA ATT CCA CAA 1379 
Ala lie Pro Gin 



/ 



GCS AAT TGT GAC TCA GAA GAG GAT TCA ACC ACC CTA/TGT CTG CAG TCA 
Xaa Asn Cys Asp Ser Glu Glu Asp Ser Thr Thr Leu Cys Leu Gin Ser 
320 325 330 / 335 



1427 



CGG AGG CAG AAG CAG CGC CAG GTG TCC GGG GAC 
Arg Arg Gin Lys Gin Arg Gin Val Ser Gly Asp 
340 345 



3C CAC GCG CAC GTT 
His Ala His Val 
350 



1475 



AGC AGA CAG GGA GCT TGG AAA GTT CAT ACG CAg 
Ser Arg Gin Gly Ala Trp Lys Val His Thr G] 
355 360 



ATC GAT TAC ATA CAC 
lie Asp Tyr lie His 
365 



1523 



TGC CTC GTG CCA GAT TTG CTT CAG ATC ACA /GGG AAT CCC TGT TAC TGG 
Cys Leu Val Pro Asp Leu Leu Gin lie Thr/ Gly Asn Pro Cys Tyr Trp 
370 375 / 380 



1571 



GGC GTG ATG GAC CGA TAC GAG GCC GAA 
Gly Val Met Asp Arg Tyr Glu Ala Glu 
385 390 



:C CTT CTA GAA GGG AAA CCG 
ila Leu Leu Glu Gly Lys Pro 
395 



1619 



GAA GGC ACG TTC TTG CTC AGG GAC 
Glu Gly Thr Phe Leu Leu Arg Asp S€ 
400 405 



GCA CAG GAG GAC TAC CTC TTC 
Ala Gin Glu Asp Tyr Leu Phe 
410 415 



1667 



TCT GTG AGC TTC CGC CGC TAC AAC AGG TCT CTG CAC GCC CGG ATC GAG 
Ser Val Ser Phe Arg Arg Tyr Asn/Arg Ser Leu His Ala Arg lie Glu 
420 / 425 430 



1715 



CAG TGG AAC CAC AAC TTC AGC TTG GAT GCC CAT GAC CCC TGC GTG TTT 
Gin Trp Asn His Asn Phe Ser Pme Asp Ala His Asp Pro Cys Val Phe 
435 / 440 445 



1763 



CAC TCC TCC ACW GTC ACG GGG £TT CTC GAA CAC TAT AAA GAC CCC AGC 
His Ser Ser Xaa Val Thr Gly/Leu Leu Glu His Tyr Lys Asp Pro Ser 
450 / 455 460 



1811 



TCT TGC ATG TTT TTT GAA COG TTG CTA ACG ATA TCA CTG AAT AGA ACT 

Ser Cys Met Phe Phe Glu Pro Leu Leu Thr lie Ser Leu Asn Arg Thr 

465 4/70 475 

TTC CCT TTC AGC CTG CAG JTAT ATC TGC CGC GCA GTG ATC TGC AGA TGC 

Phe Pro Phe Ser Leu Gln/Tyr lie Cys Arg Ala Val lie Cys Arg Cys 

480 485/ 490 495 



ACT ACG TAT GAT GGG 
Thr Thr Tyr Asp Gly I] 
500 

GAT TTT TTA AAA GAG 
Asp Phe Leu Lys Glu 
515 



GAC GGG CTC CCG CTA CCG TCG ATG TTA CAG 

Asp Gly Leu Pro Leu Pro Ser Met Leu Gin 
505 510 

fAT CAT TAT AAA CAA AAA GTT AGG GTT CGC TGG 

rr His Tyr Lys Gin Lys Val Arg Val Arg Trp 

520 525 



1859 



1907 



1955 



2003 
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TTA GAA CGA GAR CCA GTC AAA GCA AAG TAACTCCTGT/ CCCCAAAGGG 
Leu Glu Arg Xaa Pro Val Lys Ala Lys 
530 535 

CACTAACTAA GTCTGCTCCT CCCGTGCATC MGAACTGCAC/ CCATAGGRAG GCAGTCAGCT 

GCTAGGATTT CCCACCCAGA ATGGGAGCTT AGTCATTAQC CTCTGCCCTA TGGGGTCCGC 

TGTTCCTCAG ACAAAGGTGC CTAGGGACAG CAAGATGQCT TGCAGGTGTT CGGTGGGCTG 

TGACAACTGA GGGAGGCAAC TCTGGGGCAT TTGCTAtfGAA GAATTCTATT TCTTACCGAA 

GAACAAATTA TTAATATTGG ATGGGTATTT CAATAGTGTG ACTAATGTTT GAAATTATTT 

TTTCTAAGAA TTTTTCTATA ACCTTCAGAA AAAG2TAGTGA TGTTTGTAGT TACTATAAAT 

CAAGCTTTGA AAGTTCAAAA CAAACAAGTT AA^TAAAAGA CTACCTTCCT TTTAGAGAAA 

ACAAATGCAA GTTTTCCCAG CCACAGGCAT TQ^GCACTGT TAATGTTAGC TTGTTATCAG 

CTCCTTTCTC CTCC 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 536 amino acids 

(B) TYPE: amino acrid 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met Asp Lys Val Gly Lys Met/Trp Asn Asn Leu Lys Tyr Arg Cys Gin 
1 5/10 15 

Asn Leu Phe Ser His Glu G1& Gly Ser Arg Asn Glu Asn Val Glu Met 
20 / 25 30 

Asn Pro Asn Arg Cys Pro sfer Val Lys Glu Lys Ser lie Ser Leu Gly 
35 / 40 45 

Glu Ala Ala Pro Gin Gln/Glu Ser Ser Pro Leu Arg Glu Asn Val Ala 
50 / 55 60 

Leu Gin Leu Gly Leu Sef Pro Ser Lys Thr Phe Ser Arg Arg Asn Gin 
65 7/0 75 80 

Asn Cys Ala Ala Glu ite Pro Gin Val Val Glu He Ser He Glu Lys 

85 7 90 95 



2050 

2110 
2170 
2230 
2290 
2350 
2410 
2470 
2530 
2544 



Asp Ser Asp Ser Gly Ala Thr 
100 
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Pro Gly Thr Arg Leu A2!a Arg Arg Asp 
105 / 110 



Ser Tyr Ser Arg His Ala Pro 
115 



Trp Gly Gly Lys Lys/Lys His Ser Cys 
120 / 125 



Ser Thr Lys Thr Gin Ser Ser 
130 135 



Leu Asp Thr Glu Lys Lys Phe Gly Arg 

10 



Thr Arg Ser Gly Leu Gin Arg 
145 150 



Arg Glu Arg Arg /Tyr Gly Val Ser Ser 
155/ 160 



Met Gin Asp Met Asp Ser Val 

165 



Ser Ser Arg ALa Val Gly Ser Arg Ser 
170 / 175 



Leu Arg Gin Arg Leu Gin Asp 
180 



Thr Val Gly Leu Cys Phe Pro Met Arg 
185 / 190 



Thr Tyr Ser Lys Gin Ser Lys 
195 



Pro Leu Phe Ser Asn Lys Arg Lys lie 
200 / 205 



His Leu Ser Glu Leu Met Leu 
210 215 



Glu Lys Qys Pro Phe Pro Ala Gly Ser 
220 



Asp Leu Ala Gin Lys Trp His 
225 230 



Leu Ile/Lys Gin His Thr Ala Pro Val 
235 240 



Ser Pro His Ser Thr Phe Phe 
245 



Asp Th£ Phe Asp Pro Ser Leu Val Ser 
250 255 



Thr Glu Asp Glu Glu Asp Arg 
260 



Leu qxg Glu Arg Arg Arg Leu Ser lie 
>65 270 



Glu Glu Gly Val Asp Pro Pro Pro; Asn Ala Gin lie His Thr Phe Glu 
275 28(5 285 

Ala Thr Ala Gin Val Asn Pro Leu Tyr Lys Leu Gly Pro Lys Leu Ala 
290 295 / 300 

Pro Gly Met Thr Glu lie Ser 6ly Asp Gly Ser Ala lie Pro Gin Xaa 
305 310 / 315 320 

Asn Cys Asp Ser Glu Glu hsd Ser Thr Thr Leu Cys Leu Gin Ser Arg 

325 / 330 335 

Arg Gin Lys Gin Arg Gin vll Ser Gly Asp Ser His Ala His Val Ser 
340 / 345 350 



Arg Gin Gly Ala Trp Lys /fal His Thr Gin lie Asp Tyr He His Cys 
355 / 360 365 
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Leu Val Pro Asp Leu Leu Gin lie Thr Gly Asn Pro ^ys Tyr Trp Gly 
370 375 380 y 

Val Met Asp Arg Tyr Glu Ala Glu Ala Leu Leu Gin Gly Lys Pro Glu 
385 390 395 / 400 

Gly Thr Phe Leu Leu Arg Asp Ser Ala Gin Glu Asp Tyr Leu Phe Ser 
405 410 / 415 

Val Ser Phe Arg Arg Tyr Asn Arg Ser Leu His Ala Arg lie Glu Gin 
420 425 / 430 

Trp Asn His Asn Phe Ser Phe Asp Ala His Asp Pro Cys Val Phe His 
435 440 / 445 

Ser Ser Xaa Val Thr Gly Leu Leu Glu Hife Tyr Lys Asp Pro Ser Ser 
450 455 / 460 

Cys Met Phe Phe Glu Pro Leu Leu Thr/lle Ser Leu Asn Arg Thr Phe 
465 470 / 475 480 

Pro Phe Ser Leu Gin Tyr lie Cys hdg Ala Val lie Cys Arg Cys Thr 
485 / 490 495 

Thr Tyr Asp Gly lie Asp Gly Leu /Pro Leu Pro Ser Met Leu Gin Asp 
500 /505 510 

Phe Leu Lys Glu Tyr His Tyr Lyjk Gin Lys Val Arg Val Arg Trp Leu 
515 520 525 

Glu Arg Xaa Pro Val Lys Ala /ys 
530 535 



(2) INFORMATION FOR SEQ ID/ NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1221 base pairs 

(B) TYPE: nuclei c acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY/ linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GATTAAACAG CATACAGCTC CTGTGAGCCC ACATTCAACA TTTTTTGATA CTTTGATCCA 
TCTTTGGTTT CTACAGAAGA TGAAGAAGAT AGGCTTAGAG AGAGAAGGCG GCTTAGTATT 
GAAGAAGGGG TTGATCGCCC TCCCAATGCA CAAATACATA CATTTGAAGC TACTGCACAG 



60 
120 
180 
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GTTAATCCAT TATTAAACTG GGACCAAAAT TAGCTCCTGG 
ACAGTTCTGC AATTCCACAA GCTAATTGTG ACTCGGAAGA 
GCAGTCACGG AGGCAGAAGC AGCGTCAGAT ATCTGGAGAC 
ACAGGGAGCT TGGAAAGTCC ACACACAGAT TGATTACATA 
GCTTCAAATT ACAGGGAATC CCTGTTACTG GGGAGTGATC 
CCTTCTCGAA GGGAAACCTG AAGGCACGTT TTTGCTCAGG 
CTTCTTCTCT GTGAGCTTCC GCCGATACAA CAGATCOCTG 
GAATCACAAC TTTAGTTTCG ACGCCCATGA CCCGTGTGTA 
GGGACTTTTA GAACATTATA AAGATCCCAG TTCGTGCATG 
TATATCACTA AATAGGACTT TCCCTTTTAG COTGCAGTAT 
CAGGTGCACT ACGTATGATG GAATTGATGG yGCTCCCTCTA 
TTTAAAAGAG TATCATTATA AACAAAAAGT TAGAGTTCGC 
CAAGGCAAAG TAAACTCTCC GGTCCCCAAA GGGTGTTAAC 
TCAGACAGTA CACCTATAGC AAGCACATCT AGCAGTGTTA 
AAGCTTAGTG TTAGTATCTG TCAGATGCTA CCTGCTGTTA 
GCCTATTGGA ACAATAGCGG ATAGAGCTAC AGGTGTTCAG 
GCCTATTTCG CTAACAGTTT GGTj/TTTTAAT GGCTGTGGTA 
GGCATTTGTT ATGAAGAAAT G 
(2) INFORMATION FOR SEQf ID NO: 20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH:/ 2369 base pairs 

(B) TYPE: rfucleic acid 

(C) STRANBEDNESS : single 

(D) TOPOLOGY ; linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAMfi/KEY: CDS 

(B) LOCATION: 116.. 1330 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



AATGACTGAA 
GGATACAACC 
AGOCATACCC 
lCTGCTTCG 
GACCGTTATG 
GACTCTGCGC 
CATGCCCGAA 
TTTCACTCCT 
TTTTTTGAAC 
ATCTGTCGCG 
CCCTCAATGT 
TGGTTGGAAC 
TAGGTCCGCT 
GGCTTTTTCA 
CTTATTCAGA 
TAAGACTACA 
TTTGAGTGAG 



ATAAGTGGGG 
ACCCTGTGTT 
ATGTTAGCAG 
TGCCTGATTT 
AAGCAGAAGC 
AAGAGGACTA 
TTGAGCAGTG 
CCACTGTAAC 
CATTGCTTAC 
CGGTAATCTG 
TACAGGATTT 
GAGAACCAGT 
TTCATGTGCA 
TACAGTATGT 
TAAACATGGT 
AAAACATTTT 
GCAACTCTGG 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1221 
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GGCACGAGGC GGTGGTGGCG GCGGCGGGCG CGGCCGCGGC GGGGO 



GCCCACGGCC CTGGGGGCTG AGGCGCCCGC CGCCTGGGGC GGGCCGgGCG TCCTC ATG 

Met 
1 

GAG GCC GGA GAG GAG CCG CTG CTG CTG GCT GAA CTC AAG CCT GGG CGC 
Glu Ala Gly Glu Glu Pro Leu Leu Leu Ala Glu Leu Lys Pro Gly Arg 
5 10 / 15 

CCC CAC CAG TTC GAC TGG AAG TCA AGC TGC GAG ACC TGG AGC GTG GCC 
Pro His Gin Phe Asp Trp Lys Ser Ser Cys Glot Thr Trp Ser Val Ala 
20 25 / 30 




G CGGAATGAAG 



TTC 


TCG 


CCA 


GAC 


GGT 


TCC 


TGG 


TTC 


GCC 


TGG' 


TCT 


CAA 


GGA 


CAC 


TGC 


GTG 


Phe 


Ser 


Pro 


Asp 


Gly 


Ser 


Trp 


Phe 


Ala 


Tr£ 


Ser 


Gin 


Gly 


His 


Cys 


Val 




35 










40 










45 










GTC 


AAG 


CTG 


GTC 


CCC 


TGG 


CCC 


TTA 


GAG, 


/gaa 


CAG 


TTC 


ATC 


CCT 


AAA 


GGA 


Val 


Lys 


Leu 


Val 


Pro 


Trp 


Pro 


Leu 


Gil/ 


Glu 


Gin 


Phe 


He 


Pro 


Lys 


Gly 



50 



55 



60 



65 



TTC GAA GCC AAG AGC CGA AGC AGC AAG AAT GAC CCA AAA GGA CGG GGC 

Phe Glu Ala Lys Ser Arg Ser Ser/Lys Asn Asp Pro Lys Gly Arg Gly 

70 / 75 80 

AGT CTG AAG GAG AAG ACG CTG G^C TGT GGC CAG ATT GTG TGG GGG CTG 

Ser Leu Lys Glu Lys Thr Leu A*sp Cys Gly Gin lie Val Trp Gly Leu 

85 / 90 95 

GCC TTC AGC CCG TGG CCC TCT/ CCA CCC AGC AGG AAA CTC TGG GCA CGT 

Ala Phe Ser Pro Trp Pro Ser Pro Pro Ser Arg Lys Leu Trp Ala Arg 

100 / 105 110 



CAC CAT CCC CAG GCG CCT GAT GTT TCT TGC CTG ATC CTG GCC ACA GGT 
His His Pro Gin Ala Pro Asp Val Ser Cys Leu lie Leu Ala Thr Gly 
115 7120 125 

CTC AAC GAT GGG CAG ATC AAG ATT TGG GAG GTA CAG ACA GGC CTC CTG 
Leu Asn Asp Gly Gin lie Lys He Trp Glu Val Gin Thr Gly Leu Leu 
130 165 140 145 

CTT CTG AAT CTT TCT IGGC CAC CAA GAC GTC GTG AGA GAT CTG AGC TTC 
Leu Leu Asn Leu Ser/ Gly His Gin Asp Val Val Arg Asp Leu Ser Phe 
150 155 160 



ACG CCC AGC GGC AGT TTG ATT TTG GTC TCT GCA TCC CGG GAT AAG ACA 
Thr Pro Ser Gly S.er Leu He Leu Val Ser Ala Ser Arg Asp Lys Thr 
165 I 170 175 



60 
118 

166 

214 

262 

310 

358 

406 

454 

502 

550 

598 

646 
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CTT CGA ATT TGG GAC CTG AAT AAA CAC GGT AAG CAG ATC CAG GTG TTA 

Leu Arg lie Trp Asp Leu Asn Lys His Gly Lys Gin Lie Gin Val Leu 
180 185 yi90 

TCC GGC CAT CTG CAG TGG GTT TAG TGC TGC TCC ATC TCC CCT GAC TGT 

Ser Gly His Leu Gin Trp Val Tyr Cys Cys Ser lie Ser Pro Asp Cys 
195 200 /205 

AGC ATG CTG TGC TCT GCA GCT GGG GAG AAG TGfc GTC TTT CTG TGG AGC 

Ser Met Leu Cys Ser Ala Ala Gly Glu Lys ser Val Phe Leu Trp Ser 

210 215 /220 225 




694 



742 



790 



ATG CGG TCC TAC ACA CTA ATC CGG AAA Cjh GAA GGC CAC CAA AGC AGT 
Met Arg Ser Tyr Thr Leu lie Arg Lys Leu Glu Gly His Gin Ser Ser 
230 £35 240 



838 



GTT GTC TCC TGT GAT TTC TCT CCT GASf TCA GCC TTG CTT GTC ACA GCT 
Val Val Ser Cys Asp Phe Ser Pro Agp Ser Ala Leu Leu Val Thr Ala 
245 2^50 255 



886 



TCG TAT GAC ACC AGT GTG ATT AT<y TGG GAC CCC TAC ACC GGC GCG AGG 
Ser Tyr Asp Thr Ser Val lie Met Trp Asp Pro Tyr Thr Gly Ala Arg 
260 265 270 



934 



CTG AGG TCA CTT CAT CAC ACA yCAA CTT GAA CCC ACC ATG GAT GAC AGT 
Leu Arg Ser Leu His His Thr/Gln Leu Glu Pro Thr Met Asp Asp Ser 
275 280 285 



GAC GTC CAC ATG AGC TCC 
Asp Val His Met Ser Ser 
290 295 



TTG TAT CTC GCT ACG GT 
Leu Tyr Leu Ala Thr V 
310 



eu 




G AGG TCC GTG TGC TTC TCA CCT GAA GGC 
Arg Ser Val Cys Phe Ser Pro Glu Gly 
300 305 



GCA GAT GAC AGG CTG CTC AGG ATC TGG GCT 
Ala Asp Asp Arg Leu Leu Arg lie Trp Ala 
315 320 



CTG GAA CTG AAG GCT CCG GTT GCC TTT GCT CCG ATG ACC AAT GGT CTT 
Leu Glu Leu Lys Ala /Pro Val Ala Phe Ala Pro Met Thr Asn Gly Leu 
325 / 330 335 

TGC TGC ACG TTC TTC CCA CAC GGT GGA ATT ATT GCC ACA GGG ACG AGA 
Cys Cys Thr Phe Phe Pro His Gly Gly lie lie Ala Thr Gly Thr Arg 
340 / 345 350 



982 



1030 



1078 



1126 



1174 



GAT GGC CAT GTC/CAG TTC TGG ACA GCT CCC CGG GTC CTG TCC TCA CTG 1222 
Asp Gly His Va]/Gln Phe Trp Thr Ala Pro Arg Val Leu Ser Ser Leu 
355 / 360 365 

AAG CAC TTA TGC AGG AAA GCC CTC CGA AGT TTC CTG ACA ACG TAT CAA 1270 
Lys His Leu dys Arg Lys Ala Leu Arg Ser Phe Leu Thr Thr Tyr Gin 
370 / 375 380 385 
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GTC CTA GCA CTG CCA ATC CCC AAG AAG ATG AAA GAG T^C CTC AC A TAC 1318 

Val Leu Ala Leu Pro lie Pro Lys Lys Met Lys Glu JPhe Leu Thr Tyr 
390 395 / 400 

AGG ACT TTC TAGCAGTGCC GGCTCCCCCA CCTCCTGCAG ^AGCAGCAGT 1367 
Arg Thr Phe 

405 

ACAAGGGACT GGCTAGGATG GAGTCAGGCA GCTCACACT0 GACCAGTGTG GACCTTCCTT 1427 

CCTCCCATGG CATGTGCAAG TAGGTCTGCG TGACCCCACT TCTGTGGTGC CGGCCTTACC 1487 

TCGTCTTCAT CCGTGGTGAG CAGCCTTCGT CAGTCT#GTT GTGTTGAAGC CAAGTGCAGT 1547 

TGTGGATGTT GCTGGGGTAA TAAAGGCAAG CGGGC^CCAG AGCCTCTCTG GTGGCGGCCA 1607 

AGCCACACTC CCTTAACTGG GAAGTACCTG CCAQTGTAGGG CATTTCTGCT GCCTATTTCC 1667 

AGCCAGCGGC TGCATGGTTT GAAGTTCCTC CG^TGTGGTC AGAAGAACTC TGGTGTTTGG 1727 

TTCCCTGCTC AGCTGCGCGT GGACTGGGCT QAGCTCCTCA CCATACACTA GTGCCGGCTT 1787 

TTGTTTCCTG TAAACAGTGG TTGCATGTGT /AGAGAAGTAA CAAGCGAGTA TTCAGATCAT 1847 

ACGAGGAGGC GTTCCTCGGT GCATGACGG? CAGATGGCCA TTTATCAGCA TATTTATTTG 1907 

TATTTTCTCA GCACATAGTA AGGTACAi^CT GTGTTTTCTC AATTGTCTCG AAAAAACAGA 1967 

GTTCTTAAGT GGCCCAGTTG TGGAGCQ&AG TCTAAGTCGT GTGGAGTCAG TGCTGACATC 2027 

ACTGGCTTGT GCTGTCTGTC ACATGfGTTT GTCTCTGCTG CTTGACCTCA TGGGATGTAC 2087 

CCTCCAGTTC AACTGCCCAA AACAgACAGC CCCTTCCAAG CACCGTTCTT TGACAGCGGT 2147 

AGCAGCTACC TATTCAAGAC GCCfCACACA AAATCTGCCT TAGAAAGTTA ATATATTTTA 2207 

AATTATTTTA AAAGAAACTC AA£ATCTTAT TCTTTGGCCT TTCTTAATTG ATGCTTTATG 2267 

GAGGCAGTGT TAACATTGTA CAGTGTATGC ATAGAGGAGT CTCCTCTATT TGAAGAACAA 2327 

TGCAAAATGA GGCTTTCATT fiAAGGGAAAA AAAAAAAAAA AA 2369 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE ^CHARACTERISTICS : 

(A) LEMGTH: 404 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECUKE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 




Met Glu Ala Gly Glu Glu Pro 
1 5 
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Leu Leu Leu Ala Glu Led Lys Pro Gly 
10 / 15 



Arg Pro His Gin Phe Asp Trp 
20 



Lys Ser Ser Cys Glu/Thr Trp Ser Val 
25 / 30 



Ala Phe Ser Pro Asp Gly Ser 
35 



Trp Phe Ala Trp Ser Gin Gly His Cys 
40 / 45 



Val Val Lys Leu Val Pro Trp 
50 55 



Pro Leu Glu Gl/i Gin Phe lie Pro Lys 

60 



Gly Phe Glu Ala Lys Ser Arg 
65 70 



Ser Ser Lys ^.sn Asp Pro Lys Gly Arg 
75 80 



Gly Ser Leu Lys Glu Lys Thr 

85 



Leu Asp Cys Gly Gin lie Val Trp Gly 
r 90 95 



Leu Ala Phe Ser Pro Trp Pro 
100 



Ser Pro/ Pro Ser Arg Lys Leu Trp Ala 

110 



Arg His His Pro Gin Ala Pro 
115 



Asp yal Ser Cys Leu lie Leu Ala Thr 
120 / 125 



Gly Leu Asn Asp Gly Gin lie 
130 135 



Ly§f lie Trp Glu yal Gin Thr Gly Leu 
140 



Leu Leu Leu Asn Leu Ser Gly 
145 150 



lis Gin Asp Val Val Arg Asp Leu Ser 
155 160 



Phe Thr Pro Ser Gly Ser Lei 
165 



lie Leu Val Ser Ala Ser Arg Asp Lys 
170 175 



Thr Leu Arg lie Trp Asp l/eu 
180 



Asn Lys His Gly Lys Gin lie Gin Val 
185 190 



Leu Ser Gly His Leu Gin/ Trp 
195 



Val Tyr Cys Cys Ser lie Ser Pro Asp 
200 205 



Cys Ser Met Leu Cys Ser Ala 
210 / 215 



Ala Gly Glu Lys Ser Val Phe Leu Trp 
220 



Ser Met Arg Ser Tyr /Thr Leu 
225 /230 



lie Arg Lys Leu Glu Gly His Gin Ser 
235 240 



Ser Val Val Ser Cyys Asp Phe 
2 



Ser Pro Asp Ser Ala Leu Leu Val Thr 
250 255 



Ala Ser Tyr Asp jhr Ser Val 
260 



lie Met Trp Asp Pro Tyr Thr Gly Ala 
265 270 



Arg Leu Arg Ser Leu His His 
275 

Ser Asp Val His Met Ser Ser 
290 295 

Gly Leu Tyr Leu Ala Thr Val 
305 310 

Ala Leu Glu Leu Lys Ala Pro 
325 

Leu Cys Cys Thr Phe Phe Pro 
340 

Arg Asp Gly His Val Gin Phe 
355 

Leu Lys His Leu Cys Arg Lys 
370 375 

Gin Val Leu Ala Leu Pro lie 
385 390 

Tyr Arg Thr Phe 
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Thr Gin Leu Glu Pro Ttu/ Met Asp Asp 
280 21 

Leu Arg Ser Val Cys /he Ser Pro Glu 
300> 

Ala Asp Asp Arg Leu Leu Arg lie Trp 
315 / 320 

Val Ala Phe Ala/ Pro Met Thr Asn Gly 
330 / 335 

His Gly Gly lie He Ala Thr Gly Thr 
345 / 350 

Trp Thr Ali Pro Arg Val Leu Ser Ser 
360 / 365 

Ala Leu /rg Ser Phe Leu Thr Thr Tyr 
380 

Pro Ly/s Lys Met Lys Glu Phe Leu Thr 
395 400 



(2) INFORMATION FOR SEQ ID NO: £2: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1246 bise pairs 

(B) TYPE: nucleic yacid 

(C) STRANDEDNESS:/ single 

(D) TOPOLOGY: lirfear 

(ii) MOLECULE TYPE: DMA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
GACACTGCAT CGTCAAACTG AtfCCCCTGGC CGTTGGAGGA GCAGTTCATC 
TTGAAGCCAA AAGCCGAAGT AGCAAAAATG AGACGAAAGG GCGGGGCAGC 
AGACGCTGGA CTGTGGTCAg/aTTGTCTGGG GGCTGGCCTT CAGCCTGTGC 
CAGCAGGAAG CTCTGGGCA£ GCCACCACCC CCAAGTGCCC GATGTCTCTT 
TGCTACGGGA CTCAACGA^G GGCAGATCAA GATCTGGGAG GTGCAGACAG 
TTTGAATCTT TCCGGCCACC AAGATGTCGT GAGAGATCTG AGCTTCACAC 
TTTGATTTTG GTCTCCffiCGT CACGGGATAA GACTCTTCGC ATCTGGGACC 



CCTAAAGGGT 60 

CCAAAAGAGA 120 

TTTCCCCACC 180 

GCCTGGTTCT 240 

GGCTCCTGCT 300 

CCAGTGGCAG 360 

TGAATAAACA 420 



- 141 - 



CGGTAAACAG 


ATTCAAGTGT 


TATCGGGCCA 


CCTGCAGTGG 


GTTTAOTGCT 


GTTCCATCTC 


480 


CCCAGACTGC 


AGCATGCTGT 


GCTCTGCAGC 


TGGAGAGAAG 


TCGGJTCTTTC 


TATGGAGCAT 


540 


GAGGTCCTAC 


ACGTTAATTC 


GGAAGCTAGA 


GGGCCATCAA 


AGQfAGTGTTG 


TCTCTTGTGA 


600 


CTTCTCCCCC 


GACTCTGCCC 


TGCTTGTCAC 


GGCTTCTTAC 


GATACCAATG 


TGATTATGTG 


660 


GGACCCCTAC 


ACCGGCGAAA 


GGCTGAGGTC 


ACTCCACCACy 


/ACCCAGGTTG 


ACCCCGCCAT 


720 


GGATGACAGT 


GACGTCCACA 


TTAGCTCACT 


GAGATCTGTG 


TGCTTCTCTC 


CAGAAGGCTT 


780 


GTACCTTGCC 


ACGGTGGCAG 


ATGACAGACT 


CCTCAGGATC 


TGGGCCCTGG 


AACTGAAAAC 


840 


TCCCATTGCA 


TTTGCTCCTA 


TGACCAATGG 


GCTTTGOTGG 


CACATTTTTT 


CCACATGGTG 


900 


GAGTCATTGC 


CACAGGGACA 


AGAGATGGCC 


ACGTC^AGTT 


CTGGACAGCT 


CCTAGGGTCC 


960 


TGTCCTCACT 


TV TV /"I /"^ TV 1 TV H TV 

GAAGCACTTA 


m/^/"!/"^/*^/*^ TV TV TV o 

TGC CGG AAAG 


CCCTTCGAAG 


1 1 utm i lorim TV TV /™i TV 

TTTCCTAACA 


tv /tmm tv tv tv /~i 

ACTTACCAAG 




TCCTAGCACT 


GCCAATCCCC 


AAGAAAATGA 


AAGAGTTCCT 


CACATACAGG 


ACTTTTTAAG 


1080 


CAACACCACA 


TCTTGTGCTT 


CTTTGTAGCA 


GQGTAAATCG 


TCCTGTCAAA 


GGGAGTTGCT 


1140 


GGAATAATGG 


GCCAAACATC 


TGGTCTTGCA 


OTGAAATAGC 


ATTTCTTTGG 


GATTGTGAAT 


1200 


AGAATGTAGC 


AAAACCAGAT 


TCCAGTGTAC 


/TAGTCATGGA 


TTTTTC 




1246 



(2) INFORMATION FOR SEQ ID NO: 2 3/: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 422 base /pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: lineaf 

(ii) MOLECULE TYPE: DNA / 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

ACCATGGTTC CAAGTCCTCT CCCOTGTGGT CAAGTTGCCC GAATGTTGGG CCCAAGTGCC 60 

TTTTCCTCCT TGGGCCTCCC CTTOTGACCT GCAGGACAGT TTTCCGGAGC CCATTTGGTA 120 

TGAGGTATTA ATTAGCCTTA ACTAAATTAC AGGGGACTCA GAGGCCGTGC TCCTGACCGA 180 

TCCAGACACT ATTTTTTTTT TTITTTTTTTA ACAATGGTGT GCATGTGCAG GAAATGACAA 240 

ATTTGTATGT CAGATTATAC AAGGATGTAT TCTTAAACCG CATGACTATT CAGATGGCTA 300 

CTGAGTTATC AGTGGC C ATT TATTAGCATC ATATTTATTT GTATTTTCTC AACAGATGTT 360 



• 
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AAGGTACAAC TGTGTTTTTC TCGATTATCT AAAAACCATA GTACTT/ 
AA 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2019 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2> 
GGCACGAGGC GGGGTCAGGG CGGAGGCTGA GGACCAAC5TA GGCATGGCGG 
CGGCCCCGAT GGACGGGCCG GCCCGGGACC CGCAGGTCCT AATCTGAAGG 
GGAGCAGTTC TGTGACCATC CACTGGAGCA CTGT^ACGAT ACAAGACTCC 
CTATGTAGGG GACCTCCAGA CCCTCAGGAA CCTACTGCAA GAGGAGAGCT 
CATCAATGAG AAGTCTGTCT GGTGCTGCGG CTGGCTTCCC TGCACACCAC 
AGCCACTGCA GGCCATGGGA ACTGTGTGGA CETTCCTCATA CGCAAAGGGG 
CCTGGTGGAT GTCAAGGGGC AGACTGCCCt/gTATGTGGCT GTAGTGAACG 
GAGCACTGAG ATCCTTTTGG AAGCTGGTGjC TGATCCCAAC GGCAGCCGGC 
CACTCCTGTG TACCATGCCT YTCGTGTQGG TAGGGACGAC ATCCTGAAGG 
GTATGGGGCA GATGTTGATG TCAACC^TCA TCTGAATTCT GACACCCGGC 
ACGGCGGCTA ACCTCCTTGG TGGTCTGTCC TCTATACATC AGTGCTGCCT 
TCAGTGCTTC AGGCTGCTCT TGCAJ&GCTGG GGCAAATCCT GACTTCAATT 
TGTCAACACC CAGGAGTTCT ACi^GGGATC CCCTGGGTGT GTCATGGATG 
CCATGGCTGT GAAGCAGCCT TGGTGAGTCT GTTGGTAGAG TTTGGAGCCA 
GGTGAAGTGG GAATCCCTGG QCCCAGAGGC AAGAGGCAGA AGAAAGATGG 
CTTGCAGGTC TTTAAAGAGG /CCAGAAGTAT TCCCAGGACC TTGCTGAGTT 
GGCTGTGAGA AGAGCTCTTOS GCAAATACCG ACTGCATCTG GTTCCCTCGC 
AGACCCCATA AAGAAGTT2TT TGCTTTATGA GTAGCATTCA CATGCAGTGC 



^T TGAAAAAAAA 



420 
422 



AGGGCGGGAC 
AGTGGCTGAG 
ATGATGCAGC 
ACCGGAGCCG 
TGAGGATCGC 
CCGAGGTGGA 
GGCACTTGGA 
ACCACCGCAG 
CTCTTATCAG 
CCCCTTTTTC 
ACCATAACCT 
GCAATGGCCC 
CTGTCCTGCG 
ACCTGAACCT 
ATCCTGAGGC 
TGTGCCGGGT 
TGCCGCTGCC 
TGACTGCAAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
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GTGGAAGCCG ATCACCTGCA GTGAAAACTG ACACAGACTC TGGCATCCjafe GGAACCATGG 1140 

CCTGTGCTGC CAGCTTGATC CTTGGCTGTC AGTGAAGAAA AAACGGJSTGT GTTCTCTTGG 1200 

ACTGTGATTC TATCTCAGGT GCTTGGGCCA TCGAACGCTC CTTG^GTCAT TGTCAACTGA 1260 

GAGGCACATA CAAACTTAAT TTTGTTCCTC TTCAGTCTCT CT^TTTTGGA TTCTTCCTGG 1320 

CAATGTGTGC AGCATGGGCT GAGCCTGGTG ATTGCCCTAG T@GGGAAGGC TTTTTTCTCC 1380 

AGGCTATGCA TCTATTTATG TTCCTACTTT GCAATTTATT yGTTCTTTTAA GGCTTGATAT 1440 

CAAAACAGAA AGAGGTTTGT TAAGAAAAGA TATAGGGAGA' AAGGAATTCC GGTTCCGTGC 1500 

/ 

ACTTGCTAGC CTGCTTTCCT TGCCTGGGTT TGTCTGTCTA TGCTGCCTGG TGCACATCCC 1560 

TTCTCTTTGC TGCCACTGTT CTATTTTGGG AGTTGTCTTC CGTCTAAGAT GGCTTCTGGG 1620 

GTTCTATCTT ATTGCACAGA GGTCCCAGAA CAGTGTTCAT AGGGCACCAT CTGCTCTGCC 1680 



AAGGGTTTTC TGATGTCTTA CCCTGGGGAT CTTGAGACAG TGGTTACCTT TAGGAGACCC 1740 

ACCTGGAACT AACCATTAAG TGACTGCCCA CATTCAGATC AGGGACCATC TTAATAGTAC 1800 

/ 

TCACTGCCAG TCCTCACAAG AGAAGATGAC ACGGGTGCTC TCTTCAGACA CTCCCATACA 1860 

GGAAGTTGGA AAATGTCTTG GTCACCTGGG/tTGTTCCCAG GCTACAACTT CTTGGTGTTC 1920 

CACTAARACC AGRATATCCT AGTTTTTTGG GTTGACTGTT CCCTCCCCAC TTTCCTTGAA 1980 

NCCCAATGCC CNTTTGTKTN GGTTGCTTCC CTAAAAKTT 2019 



(2) INFORMATION FOR SEQ ID NO: 25: 

/ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 350 amino acids 

(B) TYPE: amino yacid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: /DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



Ala Arg Gly Gly Val Arg Ala Glu Ala Glu Asp Gin Val Gly Met Ala 
1 k 10 15 

Glu Gly Gly Tha? Gly Pro Asp Gly Arg Ala Gly Pro Gly Pro Ala Gly 
20/ 25 30 

Pro Asn Leu Lys Glu Trp Leu Arg Glu Gin Phe Cys Asp His Pro Leu 
35 / 40 45 



m 
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Glu His Cys Asp Asp Thr Arg Leu His Asp Ala Ala Tyr /Val Gly Asp 
50 55 60 

Leu Gin Thr Leu Arg Asn Leu Leu Gin Glu Glu Ser 2fyr Arg Ser Arg 
65 70 75 / 80 

lie Asn Glu Lys Ser Val Trp Cys Cys Gly Trp Lieu Pro Cys Thr Pro 

85 90 / 95 

Leu Arg lie Ala Ala Thr Ala Gly His Gly Assf Cys Val Asp Phe Leu 
100 105 / 110 

lie Arg Lys Gly Ala Glu Val Asp Leu Val Asp Val Lys Gly Gin Thr 
115 120 / 125 

Ala Leu Tyr Val Ala Val Val Asn Gly h/s Leu Glu Ser Thr Glu lie 
130 135 / 140 

Leu Leu Glu Ala Gly Ala Asp Pro Asri Gly Ser Arg His His Arg Ser 
145 150 / 155 160 

Thr Pro Val Tyr His Ala Xaa Arg /Val Gly Arg Asp Asp lie Leu Lys 

165 / 170 175 

Ala Leu lie Arg Tyr Gly Ala Asp Val Asp Val Asn His His Leu Asn 
180 / 185 190 

Ser Asp Thr Arg Pro Pro Phe' Ser Arg Arg Leu Thr Ser Leu Val Val 
195 / 200 205 

Cys Pro Leu Tyr lie Ser Ala Tyr His Asn Leu Gin Cys Phe Arg 
210 /215 220 

Leu Leu Leu Gin Ala Gly Ala Asn Pro Asp Phe Asn Cys Asn Gly Pro 
225 2^0 235 240 

Val Asn Thr Gin Glu/phe Tyr Arg Gly Ser Pro Gly Cys Val Met Asp 

245' 250 255 

Ala Val Leu Arg h/s Gly Cys Glu Ala Ala Phe Val Ser Leu Leu Val 
260 / 265 270 

Glu Phe Gly Ala/ Asn Leu Asn Leu Val Lys Trp Glu Ser Leu Gly Pro 
275 / 280 285 

Glu Ala Arg dly Arg Arg Lys Met Asp Pro Glu Ala Leu Gin Val Phe 
290 / 295 300 

Lys Glu Aid Arg Ser lie Pro Arg Thr Leu Leu Ser Leu Cys Arg Val 
305 / 310 315 320 



• 




Ala Val Arg Arg Ala Leu Gly Lys Tyr Arg Leu His L/Su Val Pro Ser 

325 330 / 335 

Leu Pro Leu Pro Asp Pro lie Lys Lys Phe Leu Leu Tyr Glu 
340 345 / 350 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 419 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID 

GCATCCATGG CGGAGGGCGG CAGCACGACG GGC^GGCAGG GCCGGGCTCC GCAGGTCGTA 60 

ATCTGAAGGA GTGGCTGAGG GAGCAATTTT GTGATCATCC GCTGGAGCAC TGTGAGGACA 120 

CGAGGCTCCA TGATGCAGCT TACGTCGGGG yACCTCCAGAC CCTCAGGAGC CTATTGCAAG 180 

AGGAGAGCTA CCGGAGCCGC ATCAACGAg/ AGTCTGTCTG GTGCTGTGGC TGGCTCCCCT 240 

GCACACCGTT GCGAATCGCG GCCACTGCAG GCCATGGGAG CTGTGTGGAC TTCCTCATCC 300 

GGAAGGGGGC CGAGGTGGAT CTGGTGGACG TAAAAGGACA GACGGCCCTG TATGTGGCTG 360 

TGGTGAACGG GCACCTAGAG AGTACCCAGA TCCTTCTCGA AGCTGGCGCG GACCCCAAC 419 
( 2 ) INFORMATION FOR SEQ ID/ NO : 27 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59y5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY/: linear 

(ii) MOLECULE TYP/E : DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

GAGGAAGAAG AAAAGTGGAC CCTGAGGCCT TGCAGGTCTT TAAAGAGGCC AGAAGTGTTC 60 

CCAGAACCTT GCTGTGTCTG TGCCGTGTGG CTGTGAGAAG AGCTCTTGGC AAAACCGGCT 120 

TCATCTGATT CCTTGGCTGC CTCTGCCAGA CCCCATAAAG AAGTTTCTAC TCCATGAGTA 180 

GACTCCAAGT GCTSCGGTTG ATTCCAGTGA GGGAGAAAGT GATCTGCAGG GAGGTGGACA 240 







• 




W , 
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CCGAGCCCTG 


AGTGCTGTGC 


TGCTGCTGGT 


CTCCTGATGG 


CTGTTGCTGC AGAAGATGTC 


300 


CTCGTAGACT 


GTCATTGCTC 


CTCAGGTGCC 


TGGGCCGCTG 


AACAGTCCTT GGGTCATTGT 


360 


CAGCTGAGAG 


GCTTATACTA 


AAGTTATTAT 


TGTTTTTCCC 


AAG^TCTCTG TTCTGGATTT 


420 


TCAGTTGCAT 


ATTAATGTAA 


CGGGCCATGG 


GGTATGTACA 


TGTAGGGGCT GAGGTTGGAG 


480 


GCCTACTAAT 


TTCCTGTAGG 


GAAGACTCCC 


AGCACTTCTG 


iGAACTGTGCT TCTCTTTATT 


540 


TTTCTACTTC 


TCAATTTGAT 


GGTTCGATTA 


AAGCCTTCTA 


GTATCTCAAT GAAAA 


595 


(2) INFORMATION FOR SEQ ID NO: 28: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 896 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 4.. 396 

(xi) SEQUENCE DESCRIPTION/ SEQ ID NO: 28; 



CTG ATG TCC GCA ATT CTG AAG/GTT GGA CAC CAC 
Met Ser Ala He Leu Lys/ Val Gly His His 
1 5 / 10 



TGC TGG CTG CCT GTG 
Cys Trp Leu Pro Val 

15 



48 



ACA TCC GCT GTC AAT CCC Q&A AGG ATG CTG AGG 
Thr Ser Ala Val Asn Pro Gin Arg Met Leu Arg 

20 / 25 



CCA CCA CCA ACC GCT 
Pro Pro Pro Thr Ala 
30 



96 



GTT TTC AAC TGT GCC GOT TGC TGC TGT CTG TGG 
Val Phe Asn Cys Ala Ala Cys Cys Cys Leu Trp 
35 / 40 



GGG CAG ATG CTG ATG 
Gly Gin Met Leu Met 
45 



144 



AAT ACA TAC CGT GTA /GTT CAG CTT CCT GAG GAG 
Asn Thr Tyr Arg Val/ Val Gin Leu Pro Glu Glu 
50 / 55 



GCC AAG GGC TTG GTG 
Ala Lys Gly Leu Val 
60 



192 



CCA CCA GAG ATT CTA CAG AAG TAC CAT GGA TTC 
Pro Pro Glu He Lfeu Gin Lys Tyr His Gly Phe 
65 / 70 



TAC TCT TCC CTC TTT 
Tyr Ser Ser Leu Phe 
75 



240 



GCC TTG GTG AGGr CAG CCC AGG TCG CTG CAG CAT 
Ala Leu Val Arg Gin Pro Arg Ser Leu Gin His 
80 / 85 90 



CTC TGC CGT TGT GCG 
Leu Cys Arg Cys Ala 

95 



288 




Mr*-* 
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CTC CGC AGT CAC CTG GAG GGC TGT CTG CCC CAT GCA CTA CCG JtGC CTT 336 
Leu Arg Ser His Leu Glu Gly Cys Leu Pro His Ala Leu Prc^Arg Leu 
100 105 / 110 

CCC CTG CCA CCG CGC ATG CTC CGC TTT CTG CAG CTG GAC tfttT GAG GAT 384 
Pro Leu Pro Pro Arg Met Leu Arg Phe Leu Gin Leu Asp/Phe Glu Asp 
115 120 / 125 

CTG CTC TAC TAGGCTTGCT GCCCTGTGAA CAAAGCAGAC CCC^fcCCCCA 433 
Leu Leu Tyr 
130 

CCCCAAGGGC ATCTCTCAGC AATGAATGAT GCAAGGCGGT CTGTCTTCAA GTCAGGAGTG 493 

/ 

GACGCCTTGA TCCACACTTG AGAGAAGAGG CCAGATCAGC ^ACCYGGCTGG TAGTGATNGC 553 

AGAGGGCACC TGTGCAGATC TGTGTGCGCA CTGGAAATCT CTAGGCTGAA GGCYAGAGCA 613 

AATGGTGCAR GTGTTAGTCC TTGGGANGAG AGACAGAJsfGG TGAGAAAGCA AGACAGAGGT 673 

GAGAGTGCAC ATGTCAAGTG GTAGATTGCC TTAAAAGAAA GCTAAAAAAA GAAAAAGATT 733 

CGGGCGAACT TCTTTAGGGG TAATGCTGCA GCGT^TTAAA CTGACTGACC AGCGTCCATA 793 

TCTTTGGACC CTTCCCGGGT GAAAAAGCCC CTT<£ATCCTC CAGCGCTCCC CAAGGGTGCT 853 

TAGCAATACC GGGTGCTTTT CTGCCGCAAA GTJGAGTTACC AAA 896 
(2) INFORMATION FOR SEQ ID NO:29: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 130 amino acids 

(B) TYPE: amino aa'id 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: pro/tein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Met Ser Ala He Leu Lys Va/ Gly His His Cys Trp Leu Pro Val Thr 
1 5/10 15 

Ser Ala Val Asn Pro Gin dxg Met Leu Arg Pro Pro Pro Thr Ala Val 
20 / 25 30 

Phe Asn Cys Ala Ala Cys/ Cys Cys Leu Trp Gly Gin Met Leu Met Asn 
35 J 40 45 

Thr Tyr Arg Val Val din Leu Pro Glu Glu Ala Lys Gly Leu Val Pro 
50 M 55 60 



Pro Glu lie Leu Gin Lys Tyr His Gly Phe Tyr 
65 70 75 

Leu Val Arg Gin Pro Arg Ser Leu Gin His Leu 

85 90 

Arg Ser His Leu Glu Gly Cys Leu Pro His Ala 
100 105 

Leu Pro Pro Arg Met Leu Arg Phe Leu Gin Leu 
115 120 

Leu Tyr 
130 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 436 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 30 
GTGGGGGCGT CATCATGACC TCCTCTAGGG CTCTGCAACA 
AACAAATTGT TCACTGATGA ATCCACAAGG ATCTCTGGGC 
ACATGACTGT CGTCTTCGGA GAAGGCACCA A^TCGCCCCCG 
CATGGGAGAA GACGTATCCA GGCAGCAGCT GCGCGGCCCT 
ATCTAAAGGC ACGGTGTACT GAAGGTAGJTC CTGAGACATG 
GTGTTCCTCC AGGTGGAGGC TCAGGTGCCC GGGTGAGCTG 
GCGCGGCTCT GGCTGCAGGT CTCGC3K3CTC CCTGGGCTGT 
CACACCGTTG ACTGGT 

(2) INFORMATION FOR SEQ If NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22/80 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY/ linear 

(ii) MOLECULE TYPE: DNA 



Ser Ser Leu /Phe Ala 

80 

Cys Arg C^s Ala Leu 
95 

Leu Pro' Arg Leu Pro 
110 

Asp P'he Glu Asp Leu 
25 



TGACTCCTGT 
CTACAACCAG 
GCAGGTACGG 
TCAAGAGGGC 
AGTCCGATTA 
GGGCTGCAGC 
AGCTCCCGCA 



GGTGCAAATC 60 

GTCCTGGTCC 120 

CTGACACCTC 180 

ACATCCCGTC 240 

CTACAGGCAC 300 

GGGACTCAGG 360 

GATCCTTGCG 420 
436 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: /, 

TTAATAGTAC CTACATAGTA GAAAATTATA ACTCCACTTT AAAAC AATGT /TTTCTTTCTA 60 

TTCAAATCAA TTTAAAACTT TTTATAAACA TTAATGTTGC AAGAGAATQC AGTCCATTTA 120 

TGAAAATTAG TTGACAATCA AGTTCACCCA AGAAAATGTT GACTAAGOTA AAGAAATCAC 180 

AGATAAAACA TTTTACCAAA AGGATAGGTA ACACACAAAA AAATGOTATC ACAGGAAGCT 240 

ATGATCATCT AATATTTCTT TAATAATAAT TCTAGTTCCA TAGG^TTTCA TGTTATGCCA 300 

ATTTGTACCC GAGTTTAATT ACAGAAAAGG CAACAATTTC TAAATTGGTG GTATACATTT 3 60 

CTTTACAATT TTTTAATGTA AGGCCATTTA TTAAAATAGA CAAACTAGAA GATGAAAACG 420 

AAGGCAACAG AAAAATTCAA CTTTTCACAA CCAAAAGAAT/TAGCACAACC TTAGAAATAA 480 

TTTAGAAAAA AGTGTTGTTA AAAGATATGT TGCAGATCTC CGTTCCATTA CCCAAGATTA 540 

TGTCAATTCA CGATTCTAAA TAAATCTTTT TAAAGT^GA GATTAAAAAC TCATCTTCAG 600 

TGTATATGTA AATTCCGTGG TTTTATCACA CAGGTAffGTT TATTCAACAC TGCTTTGGAA 660 

ATGGACCATT TAAAAGGACA TGGCAATTTC CATTCTGTTA AGTTTCATTC AACCTTTACT 720 

TAGGGGTTGA TTACCACATG AAATGTGCTT TTA&TGCATA AAAATCACAG TGGATTAGCC 780 

AGCAAAAGGG ACTGGGCGGG GGGGGCATTG AGK3AGAATTT GATAATTCAC ATTGTGATTA 840 

TTCTGCACAT TGATGAAACA TAATTCACAC JCTCTAAAACC TCAAGACTTC CCTTTTTTAA 900 

AGAACCAAAA TAAACCCAAG ACACCTTGCt/ GACACTTCCC CACCCCTAAA CAAACTGATG 960 

ACTCTTTTAC ACATAAAACT GAAATAGTTA TGGCAGCAAA AGATTTTGAT GGCAATGAAA 1020 

GTTTGTAAAC TGTATTTCAA TCTCTTOTTC TTATTCCCAA AGTGCAAGAT GCAGGGTTCT 1080 

CAATCTTTCA GTAGTGCTTC TCCTG'BAAAT AATCCTTCAT TTTGTTTGGC AAAGGCAGTT 1140 

TCTGAATTAA GTCTATTCTG GTATACTGAC GTATAACAAA ACGACACAGG TACTGCAACG 12 00 

AGCGCACCTA TGAACCCCGG AACACTGGTT GGCAAGTTCT GACGGAAGTG CAGATTCCAG 12 60 

GCAGCGAGAC CTTGAATAAC AAAAAGCTCC CATTTTCAGA GTCCCTGATT GAATGCTCCA 1320 

ATTAGATCAA CTATGGACGT ATCTCCTTCC ACATCGGCTG TTCATAAAAG CTAAACCTAC 1380 

CATTTGAGTG CTC AATTCTA /gTGTGAAGTG TTTTACCATG GGAGCGAAAG TCACAGCTTA 1440 

AAAGGTAACG GTCGTCAGAA CTGTCCCGAA CAAGAAAAGA ACCATCTGGC ACGTTTGCTA 1500 
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PpTTPPPTTP 


TPPPTPPPAA 


PPTPTPATTP 

\^\J X V7 X \Jc\ X X VJ 


PTPPPPAPTA 

VJ 1 V-V^-V* V-.riVj J. r\ 


PP ATPPTTPP/ 


TTTPP A APTT 
XXX VjV^ri/iVj X X 


iJDU 


TTTTP APPTP 
x x x x x \_ 


PTPTPTAAPP 


PTTPTPAPAA 


V-V^.tt.x VJVJVjri\_,V, 


APTAPTTTJPP 


APTPAPTPAT 
.riv_ X Vj/iVj X x 




AAAPTPTTPP 


AAPPPPAPOA 


pp AP APTTPP 


P ATP AAA ATT 


PAAATPAPAP 


PPP AT A APTT 
V-VjV-».ri X ririV- X X 


XDOU 


TPAPPPAPPT 


rzrcnnp ttt p t 


PTPP APTP AP 


TPPAPTPA A A 


PTTPf7*PPTTT 
Vjj x X V^^L-V^V- X X X 


PPP A TTTPP A 
VjVjVjri XXX VJVjri 




X XriX x\^\^ x\j\^ 


ATTPPAPT A A 


PPA ATPPTPA 
V^ V-r\r\ 1 1 \3r\ 


AP ATTPP APP 
r\Vjr\ 1 XoVjriVjVj 


PAP>ATPP ATP 


ptp a ^Hcnon 

VJ X VxciAV».\-V-.VjV^ 


XOUU 


mprpp p pppp t 


rppmpp7V Ap Am 


VjAL. 1 V^L,V_V3 1 V3 


prpppp a A rpp a 
Vj 1 VjV_V- nA 1 V_ A 


Af7 A APPP ATT 


p apppp aptp 

V-AUv^VjVjAv- X vjr 


1 Q£Pi 


ATPP APPA AH 


A TPTPTPPPP 
/\1L 1 vjVaVjVjr 


PPAPA APTAP 
V^Va/iV^ri/iV- x r\\j 


PTPPTPPTPT 
Vj 1 V-V^. x VjVj x V^ X 


Apptp aptpt 


P ATPPTPPPP 
V^-MX V^U X VjtVjVjj 




vj/i/i/iVjV.. VJ Vjr V- 


PPTPPPAPTT 


n a pp z\ an nap 

Vj/iVJVjAVjVj/irH.V^ 


PPPAP APAPT/ 
V^VJV^/iVjfT.Vjri.V^ Jf 


TPP ATPPPAP 

1 V-.V_/\ X VjVjVjiiVj 


a ZVPAPPTPTP 


J. _7 O VJ 


CAGACAATAG 


CTCCGTGATC 


CTTCCAAAGG 


ATAP ATPPPP 


TP ATPT A A AP 


PP APHPTZiTA 
Vj-V^AVwriVj X X iA 


Z VJ U 


CTGAATGTAG 


TCCTGAGGCA 


TAAGTCCAAT 


AACGACAGGC 


ACATGTTCAT 


CCAGGTGAAG 


2100 


ATGCAGGTCT 


CCATTATGAG 


AAGCCGAGCT 


CTTCAGTGAA 


TTGGCTTGCT 


CCTGGCACGT 


2160 


GGTCTCAGAC 


TGGAGGTCGT 










9 1 an 


(2) INFORMATION FOR SEQ ID NO: 32: 










(i) SEQUENCE CHARACTERISTICS/ 

(A) LENGTH: 2649 base p&irs 

(B) TYPE: nucleic acid/ 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 7 








(ii) MOLECULE TYPE: DNA / 










(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 








GGCACGAGGC 


TGTGTCCAGC 


ACACAdAGAG 


GGCCCGGCCA 


TipTipPT" i in ipn 
Itl VjU 111 VjVj 


TTPS.P A.PPPP 


An 
o u 


TGTGTCTGTC 


TGTCACTTAG 


ACTCTTCCTC 


CCGGCTCGCA 


ppmpAPPPTP 


p a TPPTPPTT 
C/iX V-V^ X CV- X X 




ACTGGCTCCA 


GCATGACTCG 


CTTCTCTTAT 


GCAGAGTACT 


TTPPTPTPTT 
1 x bL 1 V^ J. Vj 1 1 


x v_,i-v.v_ ILi vjVj-L, 


-LOU 


TCTGCACCTT 


CCAGGTCCCC 


TTCGTCTCCC 


GAGAACCCAC 


CGGCCCGCGC 


ACCCCTGGGT 


240 


CTGTTCCAAG 


GGGTCATGCA 


GAAGTATAGC 


AGCAACCTGT 


TCAAGACCTC 


CCAGATGGCG 


300 


GCTATGGACC 


CCGTGCTGAA 


/GGCCATCAAG 


GAAGGGGATG 


AAGAGGCCTT 


GAAGATCATG 


360 


ATCCAGGATG 


GGAAGAATCy 


TGCAGAGCCC 


AACAAGGAGG 


GCTGGCTGCC 


GCTCCACGAG 


420 


GCTGCCTACT 


ATGGCCAGGT 


GGGCTGCCTG 


AAAGTCCTGC 


AGCAAGCCTA 


CCCAGGGACC 


480 



ATTGACCAAC 
CTGGATTGCC 
AGGGAGACTC 
GTGCGATACA 
TCTGTCTCCC 
GAGGCCAAGA 
GAGGCCCTGA 
GCATCAGCCC 
TCTCAGGGCG 
TCCAAGAAGG 
GTGCGCCGTA 
CTGGAGGCGC 
CGCCTCTACG 
TACGCCACCG 
CTGCTCGTGG 
GCCAACATCG 
GCCATGAAGT 
TGCTTCTCCT 
ACGACGCACC 
CCCCGGAAGT 
ACGTGCAGCT 
TCAAGGAGAA 
AGGCCATAGG 
TCAGATACTT 
GACTGTTTCC 
GTGACCTTGC 



GCACACTGCA 
TCCTGTCGCT 
CACTTTACAA 
ACGCAGACGC 
GCAATGACCT 
ATGTCTACAG 
GGTTCCTGGC 
TCTACGAGGC 
CCGATGCTAA 
GCAACTATAG 
GCGGCATCAG 
TGCTGGCCGC 
AGGACCGCCG 
AGCTGTTGCT 
CCATCCGCCA 
ACGCCTACAT 
GCCTGTCGTT 
GCCTGTACGG 
CGTGGACGAC 
GAGCCGCTGG 
GTGCTCCCGG 
GGCAGAACCT; 
AAAATACCC 
GAAATATGAG 
GGGACGGCCC 
TGGTTQTTTG 



GGAAGAGACA 
GCTCCAGGCG 
AGCCTGTGAG 
CAACCACCGC 
GGAGGTCATG 
CATCACCCCT 
CAAGCATGGT 
CAGCAAGAAT 
CAAAGCCAAC 
AATAGTGCAG 
CCCGCTGCAC 
GCGCTTCGAC 
CAGTTCTGCG 
GCTGGCGGGC 
CGGCTGCCTG 
CGCCACIJCAC 
ACTCA&GTTC 
CAACCGGCCG 
AAQGCACCTA 
GQGGGACCCA 
TGAAGGAGC 
CCGAGACCTC 
ATAAAACTCC 
AATACACAGT 
AGGTGGCCTG 
CTGGAGCTTC 
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GCATTATACC 


TGGCCACATG 


e(AGAGAACAC 


540 


GGGGCAGAGC 


CTGACATCTSC 


TAACAAATCC 


600 


CGCAAGAACG 


CGGAGGGGGT 


GAGGATATTG 


660 


TGTAACAGGG 


GCTGGACCGC 


ACTGCACGAG 


720 


GAGATCCTAG 


TGAGTGGCGG 


GGCCAAGGTG 


780 


TTGTTTGTGG 


OTGCCCAGAG 


TGGGCAGCTG 


840 


GCAGACATCAy 


'ACACGCAGGC 


CAGTGACAGT 


900 


GAGCATGAAG 


ACGTGGTAGA 


GTTTCTTCTC 


960 


AAGGACGGCC 


TGCTCCCCCT 


GCATGTTGCC 


1020 


ATGCTOCTGC 


CTGTGACCAG 


CCGCACGCGC 


1080 


CTAC2CGGCCG 


AGCGCAACCA 


CGACGCGGTG 


1140 


GTCAACGCAC 


CTCTGGCTCC 


CGAGCGCGCC 


1200 


(CTCTACTTCG 


CTGTGGTCAA 


CAACAATGTG 


1260 


GCGGACCCCA 


ACCGCGATGT 


CATCAGCCCT 


1320 


CGCACCATGC 


AGCTGCTGTT 


GGACCATGGC 


1380 


CCCACCGCCT 


TTCCAGCCAC 


CATCATGTTT 


1440 


CTTATGGACC 


TCGGCTGCGA 


TGGCGAGCCC 


1500 


CACCACCCGC 


CCCGCGACCT 


GGCCGCTTCC 


1560 


GCGTGGTGCA 


GTTCTGTGAG 


TTCCTGTCGG 


1620 


TCATCGATGT 


CCTCCTGGAC 


TATGTGGGCA 


1680 


ACATCGACAG 


CTTTGAGGAC 


TGGGCTGTCA 


1740 


TGGCTCACCT 


CTGCCGGCTG 


CGGGTTCGGA 


1800 


TGGACACACT 


GCCGCTTCCC 


GGCAGGCTAA 


1860 


AACCAGCCTG 


GAGAGGAGAT 


GTGGCCTTCA 


1920 


CATCCAGGAC 


CCCCTGGGGT 


CAGAACAGGT 


1980 


ACCCAAAGTG 


AGAACCTGAT 


GTGGGGAGTG 


2040 
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GACGTGGAAC CTCTGCTTTC ACACTGTCAG CGGATCGCAG 
TAGCCAGAGA CCTTCAACCT GGGGCCAGGG GAGAGCTGGT 
AGGAATCCTG GCCTTAAGCT GGAGAACTTG TAGGAATCCC 
AGGCTGCGAG GGAGACGCCC AGCCCAAGTA TTTTATTTCC 
CAGAAAAAAA AAAAAACATG GGCGCAGCTT ATTCCTTAGT 
CGCTTAAAGC TACTGGAAAC ATGCGTTCCA CTATGCTTGA 
ACGAGAGCCG ACGTGCTTCA AGGTTGGATT TTTGGTTGCC 
TGTCCGACGT AATTGACCCC GTGTTTTGTC ACTTTCGAGT 
TTTGGTTGTC CCCAAAATTG TGGGTGGTGT GCGGACGCQ 
ATAATCATTA CTGGAGAATG TAGAGCGGCG GTTTTAC( 
TTCCCAAAA 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 495 base pairs^ 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



ACCCGCTCTGy 


/CTTCTGGCCA 


2100 


CTGGGCAAGG 


TGGCCCAGGC 


2160 


TCACTGGACC 


CTCAGCTTTC 


2220 


GTGACACAAT 


AACGTTGTAT 


2280 


AGGGTATTTA 


CTTGCATGCG 


2340 


GA^TCCCCTT 


GCACTGGTAA 


2400 


CpTTTGGCGT 


TCCGCGGGTT 


2460 


fGTTCCGACTA 


TTGGGGGGCT 


2520 


CGAGAAGTGG 


TTCATGGGCG 


2580 


TAAATATTTT 


TTAAGCCGCC 


2640 






2649 




(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SfeQ ID NO: 33 
CCTCCTGAGA GTTCGCCGGC CCGGGCCCAA TGGGTTGTTC 
CAGCAGCAGC TTGTTCAAGA CCTCCCAGCT GGCGCCTGCG 
CAAGGATGCG ATGAAGAGGC CTTGAA&ACC ATGATCAAGG 
CCCAACAAGG AGGGCTGGCT GCCGCffGCAC GAGGCCGCAT 

GTAOCCAGGG ACCATCGACC 
CAGGGGC CACCTGGACT 
CCAACAAA TCCCGAGAGA 
TGAAGATTC TTGGTGCAGC 



CTGAAAGTCC TGCAGCGAGC 
ACAGCCGTTT ACTTGGCAAC GT 
GCAGGGGCAG AGCGGGACAT C 
AGCGCAAGAA CGCGGAAGCC 



CAAGGGGTCA 
GACCCCTTGA 
AAGGGAAGAA 
ACTATGGCCA 
AGCGCACCCT 
GTCTCCTGTC 
ACCGCTCTAC 
ACAACGCAGA 



GCTGCAACCG GGCTG 



TGCAGAAATA 60 

TAAAGGCCAT 120 

TCTCGCAGAG 180 

GGTGGGCTGC 240 

GCAGGAGGAA 300 

ACTGCTCCAA 360 

AAAGCCTGTG 420 

CACCAACAAC 480 
495 



153 



(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 709 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

GTGCAGCTCT GCTCGCGGCT GAAGGAACAC ATCGACAGCT TTCjAGGACTG 

AAGGAGAAGG CAGAACCTCC AAGACCTCTG GCTCACCTTT GCCGACTGCG 

/ 

GCCATTGGGA AATACCGTAT AAAACTCCTA GACACCTTGC CGCTCCCAGG 



AGATACCTGA AATACGAGAA CACCCAGTAA CTGGGGCCAO' GGGGAGAGAG 



J, 



TCAGACTCTT CTTACTAAGT CTCAGGACGT CGGTGTTCGC AACTCCAAGG 
ACAGACGAGG CTGCAGGCTG CCTCCCTCTC AGCCTGGACA GCTACCAGGA 
TCTCAGGGCC CAGAGCTTTG GCCAGAGCAG AGAACAGAAT GTGTCAAGGA 
TTGTTTACAA ACTGATGAGC AGATCCCAGA CCTTCTCTAC CTTCAGGAAT 
TCTATTCCTG GGGCCAGGGC AGAGCTTGAG GTGTTCTGGG GAAGGTGGTG 
TCCCTGTGCC CCTCCACTTG TTCTGGAAAA CT^ACCACTT GACTTCAGAG 
AAAGACTAAG ATGAAGACGT GGCCCAAGGT AGGGGGTAGG GGGAGCCTGG 
GCTTTGTTAA GTATTAATAT AATAAATGTT ACACATGTGA AAAAAAAAA 
(2) INFORMATION FOR SEQ ID NO:35/ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 848 base/pairs 

(B) TYPE: nucleic acad 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: CD'S 

(B) LOCATION: l/.624 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 



GGCCGTCATC 
GGTTCGAAAG 
CAGGCTGATT 
GAGTAGCCCC 
GGACCTGGTG 
TCTCACTGGG 
GAAGAATCAT 
GGCAGAAACC 
CTCAGAGCCT 
CTTTCTCTCC 
GTCTTGGAGG 
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TTG GAG AAG TGT GGT TGG TAT TGG GGG CCA ATG AAT TGG GAA GAT GCA 
Leu Glu Lys Cys Gly Trp Tyr Trp Gly Pro Met Asn Trp Glu Asp Ala 
1 5 10 / 15 

GAG ATG AAG CTG AAA GGG AAA CCA GAT GGT TCT TTC CTG GTA CGA GAC 
Glu Met Lys Leu Lys Gly Lys Pro Asp Gly Ser Phe Leu Val Arg Asp 
20 25 / 30 

AGT TCT GAT CCT CGT TAC ATC CTG AGC CTC AGT TTC/CGA TCA CAG GGT 
Ser Ser Asp Pro Arg Tyr lie Leu Ser Leu Ser Pb 
35 40 



/ 



Arg Ser Gin Gly 
45 



ATC ACC CAC CAC ACT AGA ATG GAG CAC TAC AGA GGA ACC TTC AGC CTG 
lie Thr His His Thr Arg Met Glu His Tyr Arg /Gly Thr Phe Ser Leu 
50 55 / 60 

TGG TGT CAT CCC AAG TTT GAG GAC CGC TGT CAft TCT GTT GTA GAG TTT 
Trp Cys His Pro Lys Phe Glu Asp Arg Cys G/n Ser Val Val Glu Phe 
65 70 /75 80 

ATT AAG AGA GCC ATT ATG CAC TCC AAG AAt/gGA AAG TTT CTC TAT TTC 
lie Lys Arg Ala lie Met His Ser Lys Asn Gly Lys Phe Leu Tyr Phe 

85 9*0 95 



48 



96 



144 



192 



240 



288 



TTA AGA TCC AGG GTT CCA GGA CTG CCA 
Leu Arg Ser Arg Val Pro Gly Leu Pro 
100 105/ 



SCA ACT CCT GTC CAG CTG CTC 
Pro Thr Pro Val Gin Leu Leu 
110 



336 



TAT CCA GTG TCC CGA TTC AGC AAT GJC AAA TCC CTC CAG CAC CTT TGC 
Tyr Pro Val Ser Arg Phe Ser Asn Val Lys Ser Leu Gin His Leu Cys 
115 120 / 125 



384 



AGA TTC CGG ATA CGA CAG CTC GTC/AGG ATA GAT CAC ATC CCA GAT CTC 
Arg Phe Arg lie Arg Gin Leu Val' Arg lie Asp His lie Pro Asp Leu 
130 135 / 140 



432 



CCA CTG CCT AAA CCT CTG ATC 
Pro Leu Pro Lys Pro Leu lie 
145 150 



^CT TAT ATC CGA AAG TTC TAC TAC TAT 
>er Tyr lie Arg Lys Phe Tyr Tyr Tyr 
155 160 



480 



GAT CCT CAG GAA GAG GTA TAC CTG TCT CTA AAG GAA GCG CAG CGT CAG 
Asp Pro Gin Glu Glu Val Tyr Leu Ser Leu Lys Glu Ala Gin Arg Gin 

165 / 170 175 



528 



TTT CCA AAC AGA AGC AAG ^AGG TGG AAC CCT CCA CGT AGC GAG GGG CTC 
Phe Pro Asn Arg Ser Lys/ Arg Trp Asn Pro Pro Arg Ser Glu Gly Leu 
180 / 185 190 



576 



CCT GCT GGT CAC CAC C#A GGG CAT TTG GTT GCC AAG CTC CAG CTT TGA 
Pro Ala Gly His His Gin Gly His Leu Val Ala Lys Leu Gin Leu 
195 / 200 205 



624 
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AGAACCAAAT TAAGCTACCA TGAAAAGAAG AGGAAAAGTG AGGGAACAjSG AAGGTTGGGA 
TTCTCTGTGC AGAGACTTTG GTTCCCCACG CAAGCCCTGG GGCTTGQAAG AAGCACATGA 
CCGTACTCTG CGTGGGGCTC CACCTCACAC CCACCCCTGG GCATc/tAGG ACTGGAGGGG 
CTCCTTGGAA AACTGGAAGA AGTCTCAACA CTGTTTCTTT TTCi 
(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 207 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID flfO:36: 

Leu Glu Lys Cys Gly Trp Tyr Trp Gly Pro/ Met Asn Trp Glu Asp Ala 
15 10 15 

Glu Met Lys Leu Lys Gly Lys Pro Asp (Ay Ser Phe Leu Val Arg Asp 
20 25 / 30 

Ser Ser Asp Pro Arg Tyr lie Leu Ser/ Leu Ser Phe Arg Ser Gin Gly 
35 40 / 45 

He Thr His His Thr Arg Met Glu h4s Tyr Arg Gly Thr Phe Ser Leu 
50 55 / 60 

Trp Cys His Pro Lys Phe Glu Asp/ Arg Cys Gin Ser Val Val Glu Phe 
65 70 / 75 80 

He Lys Arg Ala He Met His S^r Lys Asn Gly Lys Phe Leu Tyr Phe 

85 / 90 95 

Leu Arg Ser Arg Val Pro Gly/Leu Pro Pro Thr Pro Val Gin Leu Leu 
100 / 105 HO 

Tyr Pro Val Ser Arg Phe Sefr Asn val Lys Ser Leu Gin His Leu Cys 
115 / 120 125 

Arg Phe Arg He Arg Gin Leu Val Arg He Asp His He Pro Asp Leu 
130 /135 140 

Pro Leu Pro Lys Pro Led He Ser Tyr He Arg Lys Phe Tyr Tyr Tyr 
145 15J0 155 160 

Asp Pro Gin Glu Glu Vial Tyr Leu Ser Leu Lys Glu Ala Gin Arg Gin 

165 / 170 175 



684 
744 
804 
848 
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Phe Pro Asn Arg Ser Lys Arg Trp Asn Pro Pro Arg Ser Glu/Gly Leu 
180 185 191 

Pro Ala Gly His His Gin Gly His Leu Val Ala Lys Leu jSln Leu 
195 200 205> 



(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 464 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO/37 
GTTCCAAGCC TAACCCATCT TTGTCGTTTG GAAATTCGGG 
CTTCACTCTG ACATTTTCAT CCATCAGTTG CCACTTCCCA 
CTCTATGAAG AGGTTTTAAG AATGAATGAG ATT£TAGAAC 
GAAACCAGCA AGGCCACCTG ACACAGGTCC TtfTAATTCTG 
TGTGTGACTG TTTGGATTTG GTGATCAAAT JGTCCATGTTT 
GTGTCTTTCC CAATATTGTG AACCTTATCQT ATCTTGCCTT 
CACTTTGTTG TGTATTATTT GTTTACCTjSA CCATTTTCTA 
GTAATTCTGA AAAAAAAAAA AAAAAAA&AA AAAAAAAAAA 
(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 747 /base pairs 

(B) TYPE: nucleac acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: /linear 

(ii) MOLECULE TYPE/ DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
GGGGATCGAA AGCGGK*3G£T TCTGGGACGC AGCTCTGGAG 
TTTCGGTGTA GAAGTGGCAG CACGGCAGAC TGGTCAAACA 
ACGCGGACAC GTGCTGTACA GTTGGACTTG CTGCCAGGGA 



CCAGTCTAAA 
GAAGTCTGCA 
CAGCAGCTAA 
TTTAGTCACA 
ACAGTTGCTT 
ACTCAGTTTT 
CTTTATTCTG 
AAAA 



AGCAGAGCAC 
GAACTATTTG 
TCAGGATGGA 
AAAGACGGCT 
TTCCCAGTTT 
ATTTCTAGTG 
CTAATAAACT 



ACGCGGCCTC GGACCAGCCA 
AATGGATTTT ACAGAGGCTT 
AGGCAATGTT AAAGTCTTAA 



60 
120 
180 
240 
300 
360 
420 
464 



60 
120 
180 
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GGAAACTGCT CAAAAAGGGC CGAAGTGTCG ATGTTGCTGA 
TTCATGAAGC AGCTTATCAC AACTCTGTAG AATGTTTGCA 
CATCTGAAAA CTACATTAAG ATGAAGACCT TTGAAGGTTT 
CAAGTCAAGG ACATTGGAAA ATCGTACAGA TTCTTTTAGA 
CAACTACTTT AGAAGAAACG ACACCATTGT TTTTAGCTGT 
TGTTAAGGCT GTTGCTTCAA CACGGAGCAA ATGTTAATC 
GGAACTCCTT GCACCAGGCT TCTTTTCAGG AAAATGCT7GA 
GAAAAGGAGC AAACAAGGAA TGCCAGGATG ACTTTGfSAAT 
CTCAGTATGG CCAAGCTAGA AAGCTTTGAA GCAT^CTTAT 
AATTGTCAAG CCTTGGACAA AGCTACC 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS/! 

(A) LENGTH: 1018 base £airs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear/ 



TAACAGGGGA 
AATGTTAATT 
CTGTGCTTTG 
CTGGGGCA 
'GAAAATGGA 
ATCCCATTCT 
GATCATAAAA 
CACACCTTTA 
TTCATCCGGG 



TGGATGCCAA 
AATGCAGATT 
CATCTCGCTG 
GATCCTAATG 
CAGATAGATG 
ATGTGTGGAT 
TTGCTTCTTA 
TTTGTGGCTG 
TGCAAATGTC 



(ii) MOLECULE TYPE: DNA 



/ 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
CACAAATGGG ACCATACAAA AATOTTGGAC TTGTTAATAA 
GTGACACTGG GCTAAACAAA GtAgTCCCT GTTTACTCAG 
GGATTGCCTA GAAATATTAC TGCGGAATGG TCTACAGCCC 
TTTTTGGATT CAGTTCTCCT GTGTGCATGG CTTTCCAAAA 
TTGGAATTGT GAACATTCTy TTGAAATATG GAGCCCAGAT 
ACTGCCTGAA GTACGAGAAE TTTTCGATAT TTCGCTACTT 
TGGGACCATG GAACCATATA TATGAATTTG TAAATCATGC 
ATAAGGAGTG GTTGCCACAT CTTCTGGTTG CTGGATTTGA 
ATTCTTGGAT TGACTCAGTC AGCATTGACA CCCTTATCTT 
GGAAGACACT TGCACCAGCT GTTGAAAGGA TGCTCTCTGC 



CCACTTACTA 
CAGTGTTTGG 
AGACGCCCAG 
GGAGGTGGAG 
AAATGAACTT 
TTTGAGGAAA 
AATTAAAGCA 
CCCACTGATT 
CACTTTGGAG 
TCGTGCCTCA 



ACCGGGACCT 
GGGACATGAA 
GCGTGCCTTG 
CTGTAGTTCT 
CATTTGGCAT 
GGTTGCTCAT 
CAAGCAAAAT 
CTACTGTGCA 
TTTACTAATT 
AACGCTTGGA 



240 
300 
360 
420 
480 
540 
600 
660 
720 
747 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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TTCTACAGCA ACATATTGCC CACTGTTCCA TCCCTGACCC 
CGGTCCAGTC TAAAATCAGA ACGTCTACGG TCTGACAGTT 
CCCAGAAGCC TACATAATTA TTTGCTCTAT GAAGACGTTC 
GAACTGGCAG CTATTCAAGA TGGATAAATC AGTGAAACTA 
TCTCTGAAAA ATCATCGAGA CAAAAGAGCC ACAGAGTAQ 
CAAAAGATGA TTATTGATTG TCAGATAGGT TAGGTTT'] 
AATGTTTATG TTTACAACTA GCCTTCCCAG T2 
(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1897 base pa^rs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : sing] 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION* SEQ ID NO: 40 
CGGGGGGCTG GGACCTGGGG CGTAMXZGTC TCTACCACGA 
AAACATACCC AGCCTTTCTG GAGGCGGACG AGACATTCAT 
TGGCCCTGGA CATGRATGAT GGGACCTTAA GTTTCATCGT 
TGGCTTTCCG GGGACTCAAG GGTAAAAAGC TGTATCCTGT 
ACTGTGAGAT CCGCATGCGC YTACTTGAACG GACTTGATCC 
ACCTGTGCCG GCGTTCGGTjG CGCCTAGCGC TGGGAAAAGA 
CTCTGCCGCT ACCTGCCtfCC CTCAAAGCCT ACCTCCTCTA 
ACCGCCATAC GACAGC0ATC TGGTGCCAAR TCACTGAGCC 
TGCGCCTGGG ATGGAAGCCC ACCTCAGCCA TGGGCAGACG 
TGCCTCTGCT GGGGGAACCT ATGCCAACGG ACTTCTCCCT 
GCAGCACCCA GGGTCCTTCCC TGAACCAGAT GCAGAGAATA 
GGCGCCTTCT GJCTCTCAGGT GGAGTGGGCT GCCCCCCACT 
CCCACCTGGG /5GGTCCTGGG AGGTAAGACT AGTAGGAGGT 




ATCTTTG 
ATATTMTCA 
TGAGGATGTA 
fTTAACACAG 
AGTTTTTATG 
GGGGCCAGTA 
AAAAAAAAAA 



TTTGGAAATT 
GCTGCCACTT 
TGAAGTTCCA 
CTAATTTTTT 
ATTTTATAGT 
GTTCAGTGAG 
AAAAAAAA 



CGGC AAGAAC 
TGTCCCTGAC 
GGATGGACAG 
AGTGAGTGCC 
TGAGCCCCTG 
GCGCCTGGGT 
CCAGTGATCC 
CGTTGGGGTC 
TGCCCCCTCA 
TCCCAACACT 
AACTATGAAA 
CTCTGCAGAG 
GCCAGGGCTG 



CAGCCAAGTA 
TCCTTTTTCG 
TACATGGGAG 
GTCTGGGGCC 
CCACTCATGG 
GCCATCCCCG 
ACATCCCAGG 
CGCCGACCCC 
TCCTACCGGC 
GGCTGAAGCA 
ACCTCTCTCA 
AGAGGCTACA 
ARTCCAAAAG 



660 
720 
780 
840 
900 
960 
1018 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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CAGGAATGGC CAGGAMCAGG CCATACAGAT GAAGCTCAGG ATGTCACATA CCATGGACAM 
TGAGACAGAA CCCCAGGTTG GAMTTCCCTT GGGCCAACGA GT|GCCAGCTT TAATGTCAGC 
TGCMGGTGCT CTGTGGCCTG TATTTATTCT TTAAACAGTA CSCAAAGGCCA TTTATTTATT 
CCACTTAGAA AGGAAACCTT GGTGGGTGGY TTCCCTCGAy GTGCTTTCCC CCACCTCCCT 
GGAATGTGTG TGCCACACCT GTCCTTGTCC CAGGCCAGjSA CTGTGGCACA TGAGCTGGTG 
TGCACAGATA CACGTATGTC GTCGTGCATG ACCCCT^ACT AGTTCCTAAG TAGCCCTGCA 
CCAAGCACCA GAGCAGACCC CAAGAGAGGC CCGTGCAAGT CCCCATGTCC CCAGGTCCCT 
GCTTCTGTTG CCTTGGGACT CATACACCGG CAC^CGTGTT TCAGCCTCTT GACTTCCATG 
AGCTTCGAAT TTTGCCCCCG ATTCTTCTGA TA5TTTCCCAT TGGCATCCTC CAAAGCTCTG 
GGCCTGGAGG GCATTAGGAC ACATGGAATG M3TGGGGTCT CCAGCCCCTG GGAAAGCCAC 
TGGCAAGGCA GGATTAGAAA GACCAAGAGc/ AGGGTGGGGC GCCATGAAGC CTGTATGCCT 
CTCAGGCTCA AGACCCCGCC ACACACCCAX: TCAAGCCTCA GAAGTGGTGT GTAGGGCAGC 
CCCAGGAGAG GAATGCCTGT CCTAGCAGCA CGTACATGGA GCACCCCACA TGTGCTCCAG 
CCCTCTGGCT GTTTCTCTTG CTCTAGAATC AACTCCCTAC ATTGGGAATG TAGCCATTTG 
GTAGAGGACT TGCCTAGCCT GCAGGAAGCT CACGTTCCAT CCCCTGCACC AAGGAGAATC 



AAAGCTCAGG AGGCTGAGGC AGGAGGATTG CTGTCAGTGG TGTACAGAGG TCATGGCCAT 

/ 

CCTGGGCTAT ATTAAACCTT GTCCTTTAAG AAAAAGAAAA GAAATCAACT TCCATTGAAT 
CTGAGTTCTG CTCATTTCTG CACAGGTACA ATAGATGACT TKATTTGTTG AAAAATGKTT 
AATATATTTA CMTATATATA' TATTTGTAAG AAGCATT 
(2) INFORMATION FOR/^EQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 134 amino acids 

(B) TYKE: amino acid 

(C) STRANDEDNESS : single 

(D) TOCOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

Gly Gl\f Trp Asp Leu Gly Arg Asn Arg Leu Tyr His Asp Gly Lys Asn 
1/5 10 15 



840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1897 



Gin 
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Pro Ser Lys Thr Tyr Pro Ala Phe Leu Glu Pro 
20 25 



Asp' Glu Thr Phe 
30 



He Val Pro Asp Ser Phe Phe Val Ala Leu Asp/Met Xaa Asp Gly Thr 
35 40 / 45 

Leu Ser Phe He Val Asp Gly Gin Tyr Met sly Val Ala Phe Arg Gly 
50 55 / 60 



Leu Lys Gly Lys Lys Leu Tyr Pro Val 
65 70 



Ser Ala Val Trp Gly His 
75 80 



Cys Glu He Arg Met Arg Tyr Leu Asn/ Gly Leu Asp Pro Glu Pro Leu 

85 / 90 95 

Pro Leu Met Asp Leu Cys Arg Arg Ser Val Arg Leu Ala Leu Gly Lys 
100 A05 110 

Glu Arg Leu Gly Ala He Pro ALa Leu Pro Leu Pro Ala Ser Leu Lys 
115 120 125 

Ala Tyr Leu Leu Tyr Gin 
130 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 265 base pairs 

(B) TYPE: nucleic/acid 

(C) STRANDEDNESS/ single 

(D) TOPOLOGY: li/iear 

(ii) MOLECULE TYPE: 0NA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42; 

AAGGGTAAAA AACTGTATCC /TGTAGTGAGT GCCGTCTGGG GCCACTGTAG ATCCGAATGC 

GCTACTTGAA CGGACTCGAff CCCGAGACTG CCGCTCATGG ATTTGTGCCG TCGCTCGGTG 

CGCCTGGCCC TGGGGAGGGA GCGCCTGGGG GAGAACCACA CCTGCCGCTG CCGGCTTCCC 

TCAAGGCCTA CCTCCT0TAC CAGTGACGTT CGCCATCATA CCGCCAGCGC GACAGCCACC 

TGGTGCCAAC TCACTGAGCC GCCTG 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A/ LENGTH: 2438 base pairs 
(/) TYPE: nucleic acid 
:) STRANDEDNESS: single 



60 
120 
180 
240 
265 
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(D) TOPOLOGY: linear / 

(ii) MOLECULE TYPE: DNA / 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: / 

AAGTGGCGGC GGTCCCTGGA GAGCAGGCGG AGGCAGCGGC ^AGTCTGACT CTGGGCTGAC 60 

CGTGGAGCCG GGGCGGGGGC TGACAGCCAG GCCTCCGCCt/ GGCGGGAGCC GCACGAGGAG 120 

CGGGAGTGGC CGGGCCTCTC TTCCGCGCTT GAGCGAGOGC CGGGTGATGG CGGTGGTGAT 180 

GGCGGCAGGC GCTCGGACAG CTCCGCTTGA GCTGAGOTCG GAGAGATCCG TCCAGAAAGT 240 

GCCCAGAAGA AACTTCCTCT TAGAAAAGCT GAAA3&CACA RTATTTATAA CACTGGAAAT 300 

TGTAAAGAAT TTGTTTAAAA TGGCTGAAAA CAA/TAGTAAA AATGTAGATG TACGGCCTAA 360 

AACAAGTCGG AGTCGAAGTG CTGACAGGAA GfSATGGTTAT GTGTGGAGTG GAAAGAAGTT 420 

GTCTTGGTCC AAAAAGAGTG AGAGTTGTTC y/TGAATCTGAA GCCATAGGTA CTGTTGAGAA 480 

TGTTGAAATT CCTCTAAGAA GCCAAGAAAG GCAGCTTAGC TGTTCGTCCA TTGAGTTGGA 540 

CTTAGATCAT TCCTGTGGGC ATAGATTJTT AGGCCGATCC CTTAAACAGA AACTGCAAGA 600 

TGCGGTGGGG CAGTGTTTTC CAATAAAGAA TTGTAGTGGC CGACACTCTC CAGGGCTTCC 660 

ATCTAAAAGA AAGATTCATA TCAGTTCAACT CATGTTAGAT AAGTGCCCTT TCCCACCTCG 720 

CTCAGATTTA GCCTTTAGGT GGCATTTTAT TAAACGACAC ACTGTTCCTA TGAGTCCCAA 780 

CTCAGATGAA TGGGTGAGTG CAGACCTGTC TGAGAGGAAA CTGAGAGATG CTCAGCTGAA 840 

ACGAAGAAAC ACAGAAGATG &CATACCCTG TTTCTCACAT ACCAATGGCC AGCCTTGTGT 900 

CATAACTGCC AACAGTGCTt/cGTGTACAGG TGGTCACATA ACTGGTTCTA TGATGAACTT 960 

GGTCACAAAC AACAGCATAG AAGACAGTGA CATGGATTCA GAGGATGAAA TTATAACGCT 1020 

GTGCACAAGC TCCAGAAAAA GGAATAAGCC CAGGTGGGAA ATGGAAGAGG AGATCCTGCA 1080 

GTTGGAGGCA CCTCCtAgT TCCACACCCA GATCGACTAC GTCCACTGCC TTGTTCCAGA 1140 

CCTCCTTCAG ATCAG2TAACA ATCCGTGCTA CTGGGGTGTC ATGGACAAAT ATGCAGCCGA 1200 

AGCTCTGCTG GAAGGAAAGC CAGAGGGCAC CTTTTTACTT CGAGATTCAG CGCAGGAAGA 1260 

TTATTTATTC TCTCTTAGTT TTAGACGCTA CAGTCGTTCT CTTCATGCTA GAATTGAGCA 1320 

GTGGAATCAT AACTTTAGCT TTGATGCCCA TGATCCTTGT GTCTTCCATT CTCCTGATAT 1380 




- 162 - 

TACTGGGCTC CTGGAACACT ATAAGGACCC CAGTGCCTGT ATGTTCTTT?G AGCCGCTCTT 
GTCCACTCCC TTAATCCGGA CGTTCCCCTT TTCCTTGCAG CATATTT&CA GAACGGTTAT 
TTGTAATTGT ACGACTTACG ATGGCATCGA TGCCCTTCCC ATTCGTTCGC CTATGAAATT 
GTATCTGAAG GAATACCATT ATAAATCAAA AGTTAGGTTA CTGAGGATTG ATGTGCCAGA 
GCAGCAGTGA TGCGGAGAGG TTAGAATGTC GACCTGCATA CATATTTTCA TTTAATATTT 
TATTTTTCTT ATGCCTCTTT GAATTTTTGT ACAAAGGCAg/tTGAATCAAA TAAAACTGTG 
CCCTAAGTTT TAATTCCAGA TCAATTTATT TTTTTTATOA TACACTTGTT ATATATTTTT 
AAGCAGGTGT TTGGTTTTGT TTTTACCATA TAAATT^ACA TATGGTCCAG GCATATTTAC 
AATTTCAAGG CATTGCATAT ACATTTGAAT ATTCTGTATT TTTTAAATAA TCTTTTGTTC 
TTTCCTATGT GTGAAATATT TTGCTAATCT ATGCTATCAG TATTCTTGTA TGACCGAATA 
GTTACCTATT CTCTTTTCAT CTTGAAGATT TfTCAGTAAAG AGTGTTGTAA TCAATCCATT 
ATAATGTAAT TGACTTTTGT AATTTGCCAA/TAGGAGTGTT AAACAACAAA ATGATTTAAA 
ATGAAACTTA ATGTATTTTC ATTTTAAAT& TTAACTAAAC CAAGTTTGTT TGTTAGTTAT 
TCTAGCCAAT AAGAAAAGAG AATGTAGCAT CCTAGAGGTG TATTTGTTCT GCAGTTTGGC 
AGGACCGTCA GTTAGTCCAA ATAAAGATCC CCTCAGCGTG GAGGCGAATG GAACCTGTGC 



:a^ag 



TCCTTTCTTA CGGGAAGCTT TGCAAAGCAA AATAGCAGGG TTACAAGCTT GGAGTTGTTA 
AGGCAACTAG AGTTTTCTCT ATT^ATTTAT AGACTGTTGT TGCACCTACT TAGCTCTTTT 
TTGGGAACTC TAGTTCCCAG GSGAAAATAC CTCGTGCC 
(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH/: 542 amino acids 

(B) TYPE : / amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE/ TYPE : DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Ser Gly Gly Gly Pro Trp Arg Ala Gly Gly Gly Ser Gly Lys Ser Asp 
1 / 5 10 15 



1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2438 
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Ser Gly Leu Thr Val Glu Pro Gly Arg Gly Leu /Thr 
20 25 



Ala Arg Pro Pro 
30 



Pro Gly Gly Ser Arg Thr Arg Ser Gly Ser Gly Arg 
35 40 



Ala Ser Leu Pro 
45 



Arg Leu Ser Glu Arg Arg Val Met Ala Val Val Met Ala Ala Gly Ala 
50 55 / 60 



Arg Thr Ala Pro Leu Glu Leu Ser Ser/ Glu Arg Ser 
65 70 / 75 



Pro Arg Arg Asn Phe Leu Leu Glu Eys Leu Lys Asn 

85 / 90 



Val Gin Lys Val 

80 

Thr Xaa Phe lie 
95 



Thr Leu Glu lie Val Lys Asn Leu Phe Lys Met Ala 
100 / 105 



Glu Asn Asn Ser 
110 



Lys Asn Val Asp Val Arg Pro/Lys Thr Ser Arg Ser 
115 / 120 



Arg Ser Ala Asp 
125 



Arg Lys Asp Gly Tyr Val Trp Ser Gly Lys Lys Leu Ser Trp Ser Lys 

130 ,435 140 

Thr Val Glu Asn 
160 



Lys Ser Glu Ser Cys Ser Glu Ser Glu Ala lie Gly 
145 154) 155 

/ 

Val Glu lie Pro Leu Arg Ser Gin Glu Arg Gin Leu 

165/ 170 



lie Glu Leu Asp Leu Asp His Ser Cys Gly His Arg 
180 / 185 



Ser Cys Ser Ser 
175 

Phe Leu Gly Arg 
190 



Ser Leu Lys Gin /Lys Leu Gin Asp Ala Val Gly Gin 
195 / 200 



Cys Phe Pro lie 
205 



Lys Asn Cys Ser Gly Arg His Ser Pro Gly Leu Pro Ser Lys Arg Lys 
210 / 215 220 



lie His lie /Ser Glu Leu Met Leu Asp Lys Cys Pro 
225 / 230 235 



Phe Pro Pro Arg 
240 



Ser Asp Leu Ala Phe Arg Trp His Phe lie Lys Arg 
/ 245 250 



His Thr Val Pro 
255 



Met Ser /Pro Asn Ser Asp Glu Trp val Ser Ala Asp 
260 265 



Leu Ser Glu Arg 
270 



Lys Leu Arg Asp Ala Gin Leu Lys Arg Arg Asn Thr 
275 280 



Glu Asp Asp lie 
285 



- 164 - 

Pro Cys Phe Ser His Thr Asn Gly Gin Pro Cys Va4 He Thr Ala Asn 
290 295 300 

Ser Ala Ser Cys Thr Gly Gly His He Thr Gly/Ser Met Met Asn Leu 
305 310 3L5 320 

/ 

Val Thr Asn Asn Ser He Glu Asp Ser Asp Met Asp Ser Glu Asp Glu 

325 33^/ 335 

He He Thr Leu Cys Thr Ser Ser Arg Lys Arg Asn Lys Pro Arg Trp 
340 345 / 350 

Glu Met Glu Glu Glu He Leu Gin Lei/ Glu Ala Pro Pro Lys Phe His 
355 360 / 365 

Thr Gin He Asp Tyr Val His Cys Leu Val Pro Asp Leu Leu Gin He 
370 375 / 380 

Ser Asn Asn Pro Cys Tyr Trp Gl/y Val Met Asp Lys Tyr Ala Ala Glu 
385 390 / 395 400 

Ala Leu Leu Glu Gly Lys Pro/Glu Gly Thr Phe Leu Leu Arg Asp Ser 

405 / 410 415 

Ala Gin Glu Asp Tyr Leu Phe Ser Val Ser Phe Arg Arg Tyr Ser Arg 
420 / 425 430 

Ser Leu His Ala Arg lief Glu Gin Trp Asn His Asn Phe Ser Phe Asp 
435 / 440 445 



Ala His Asp Pro Cys 
450 



fal Phe His Ser Pro Asp He Thr Gly Leu Leu 
455 460 



Glu His Tyr Lys Asp Pro Ser Ala Cys Met Phe Phe Glu Pro Leu Leu 
465 / 470 475 480 

Ser Thr Pro Leu Jle Arg Thr Phe Pro Phe Ser Leu Gin His He Cys 

185 490 495 

Arg Thr Val lie' Cys Asn Cys Thr Thr Tyr Asp Gly He Asp Ala Leu 
5W 505 510 

Pro He Pro Ser Pro Met Lys Leu Tyr Leu Lys Glu Tyr His Tyr Lys 
515 / 520 525 

Ser Lys Val/ Arg Leu Leu Arg He Asp Val Pro Glu Gin Gin 
530 / 535 540 



(2) INFORMATION FOR SEQ ID NO: 45: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 5000 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 
CCCTCTGGGC AAGCCGCCCC CCCCCCACCC ATCTACCAjtA 
CACACATTCA GACCTTGGGG CAAAAACAAA GCAAAATAAC 
TGGAAAGTCC TTACTTCAGG AAGGTTGGCA GATGAjSGAGC 
CTGCCACAAA GGAGTCTTTT TTTTTAATGG TTTTTCAAGA 
CTGGCTGTCC TGGAGCTCAC TTTGTAGACC AGiSCTGGCCT 
CCTCTGCCTC CTGAGTGCTG GGATTAAAGG OGTGCAGCAC 
TCAATTAAGG TTCGTTCCTT TCAGATAACt/ CTAGGTTCTG 
ACACAGCACA GTTTGTATGC CACATTCAGT TCAGAAGACA 
AACTTATGCA CATTTGTGAG CTTCCAOTTG GGAGTGGGAA 
AGAGCAGCCG TGCTCTTAAC TGCTGM3CCA TTTCAGCAGC 
GAAATTAGCC GGGTATGAAT CATAECCTTA GAATCCTAGC 
GAAACAGGGA TTCAAGACCA GCTCTTGGCT ACAGAGCCCG 
CAAGAGACTA TTTCAAAGCC ATCCAAACAA CAATAACTAC 
TAGGCTGGGC ACAGGGTACA CACCTTTAAT GCCAACACTC 
TCAGTGTGAG TTTGAGTTCA* ACGTGGTCTA CATAGGGAGT 
AGTCTCTCTC TCTCTCTCTC TCTCTCTCTC TCTCTCACAC 
ACACACACAC ACACACACGG TGGCATTATG GGATTTTTTT 
AGCCCTGGCA TAGATTCACT CTGTAGACTA GGCTAGCCTT 
CTCTGCCTCC CAAGTGCTGG GATTATAGGT GTTGCACCAC 
ATTTTTGAAC TGT/TATCAAG AGGCTTTCGA GGAGGTCAAA 
TGATAATGTA GCTAATGATC AAACGACACT CAAAACTTAA 
CAGACAGCGT jGCCCACTCGT AGTTCCATTA CTCAGGAGGC 



CACACACACA CACACACACA 60 

AACAACAAAA ACACTGCCTG 120 

AAGGGAACAT TTTATCAGGA 180 

CAGGGTTTCT CTGTATAGCC 240 

CGAACTCAGA AATTCGCCTG 300 

CATGTCCAAC TGGCATTTTC 360 

GGTCAAGCTG ACACAAGGCT 420 

CCCAACCTCC CTGGAACTGG 480 

CCTGAACTGG GTCCTCTGCA 540 

CTCACATCAG AATTAAGTTA 600 

ATCTGAAAGC AGAGCTAAGA 660 

TCCTGTCCTA GGATGGGCTA 720 

AACAACAACA AGGTTAAAAT 780 

AGGAGGCAGA GGCAGGCTGA 840 

TCTAGGCCAG CAGAGGTTAC 900 

ACACACACAC ACACACACAC 960 

GGGATAAGGT TTCTCTGTCT 1020 

GAACTCAGAG ATCCGCCTGC 1080 

CACTGCCCAG CCACTTTGGG 1140 

CTTCAACAGC AACCTCTCCA 1200 

CCCTTAAAGC ACACATCCAC 1260 

TGAAGCAGGA GGATGAAGGA 1320 
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CTAAGGCTTC AGCAACCTAG GGAGCCGCAG GGGACAGTAG TCTCAATJZCC TACATTCTCC 1380 

TGAACACAGG AGCAGGAGTT CAGGAAGGGT GTCAAGGCCG CTTACTGATC TTAGGGCCTC 1440 

AGGAATGACT AGCTCAGGCA GAGAGAGCAA AGGTCTCCAG TGGAGAAGTC TACACACACA 1500 

CACACACACA CACACACACA CACACACACA CAGAATCCAA QGCGATGACG TCATCAAAGG 1560 

GTTAATTCTA GTCTGGGATG GGGGGGAGGG TGGGGCACGC/AGCTGTCAGG TGGCTTTGGA 1620 

AAAATAAACT GCTGAAGAGT CTGACGCCAG GGAGTCCT^G GAGGGACAAG AGGTTACCCA 1680 

CTCAAAGAGT GTGCTCCACA AAGCATGCGC GCTTGT^CAC GTCTGGAGTC GTCACTTATT 1740 

TTTTGCCTGG ATTCTTTGTA GCCGGTGGGT TCTCA&GGCG GTAAGTGGTG TGGCCGCCGT 1800 

GGTCTGGGAG GTGACGATAG GGTTAATCGT CCACAGAGCC CAGGGGCGGA GCGCGGGCGG 1860 

GCGTCCGCAG CCCCGCTGGA GCCGGAAGCA GTGGCTGGTC AGGGGCGCTT CTAGCCTTCC 1920 

CTATCTGTAC TTCCACAGAG GTCTCTGCGA/GCTAGGGGGA CAGTGAGGTG CGGGGTAGGG 1980 

GCCCGGCGTT AGAGCCAGCA AGGGGACGGT TCACGGTAAG GTCTGAGGGA GAGAGAGCTC 2040 

CTGAGAAACT TGGGGGGCGC GACACAGATA GGGTGAAAGC AGAGTGATAG ACCTGGGATG 2100 

GTTAGGGGAC CAAGGGAAGA CCAGGC^TGGT TGGCATACAC CGGTGAACGG ATGGGAGTCC 2160 

TAGGGAAAGA TGATGCGCCT AACAGTCCTT TCTGTCTCCA CACCACTCCA GGGGACGATC 2220 

CGGAGCTCAA CTTTCAAAAG CGAGACGCCC CAGCAAGCCT GTTTTGAGAA GTTCTTCAGC 2280 

GGCTCTCCTC ATGGGCCAGA CGGCCCTGGC AAGGGGCAGC AGCAGCACCC CTACCTCGCA 2340 

GGCTCTGTAC TCGGACTTCt/cTCCTCCCGA GGGCTTGGAG GAGCTCCTGT CTGCTCCCCC 2400 

/ 

TCCTGACCTG GTTGCCCAAC GGCACCACGG CTGGAACCCC AAGGATTGCT CCGAGAACAT 2460 

/ 

CGATGTCAAG GAAGGGGGTC TGTGCTTTGA GCGGCGCCCT GTGGCCCAGA GCACTGATGG 2520 

AGTCCGGGGG AAACGGGGCT ATTCGAGAGG TCTGCACGCC TGGGAGATCA GCTGGCCCCT 2580 

GGAGCAAAGG GGCASACACG CCGTGGTGGG CGTGGCCACC GCCCTCGCCC CGCTGCAGGC 2640 

TGACCACTAT GCGGCGCTTT TGGGCAGCAA CAGCGAGTCC TGGGGCTGGG ATATTGGGCG 2700 
/ 

GGGAAAATTG TATCATCAGA GTAAGGGCCT CGAGGCCCCC CAGTATCCAG CTGGACCTCA 2760 

GGGTGAGCAG GTAGTGGTGC CAGAGAGACT GCTGGTGGTT CTGGACATGG AGGAGGGGAC 2820 

TCTTGGCTAC/TCTATTGGGG GCACGTACCT GGGACCAGCC TTCCGTGGAC TGAAGGGGAG 2880 
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GACCCTCTAT 


CCCTCTGTAA 


GTGCTGTTTG 


GGGCCAGTGC 


CAGGTCCGCA' 


TCCGCTACAT 


2940 


GGGCGAAAGA 


AGAGGTGAGA 


TACGGACTAG 


GTGTGGGGAG 


ATCACTACTC 


TTGGCAATGG 


3000 


TTTGGGCTGG 


AAACTCATGG 


TTGGAGCACA 


GGAAGTAGGC 


TTCTTGTCAC 


TTTGGCCTGT 


3060 


CACTTAGATG 


GCCTTGGATC 


TAGCTTCACT 


CCCAATCCCT 


AT^GGATGTG 


ATGCACAAAT 


3120 


TCAGAGCCTT 


TGGGTCTCCC 


TCAGCTGAGG 


TGGCGGTGGA 


^AATGGAGGAA 


GAAGGAAGGG 


3180 


TGCCTGAGCA 


GGATCTCAAG 


TTCAAGGATG 


CCTGGAGTT)G 


CTTACTTACC 


TTGTCTTCCT 


3240 


TCTCTCTCCG 


CAGTGGAGGA 


ACCACAATCC 


CTTCTGCACC 


TGAGCCGCCT 


GTGTGTGCGC 


3300 


CATGCTCTGG 


GGGACACCCG 


GCTGGGTCAA 


atatc/actc 


TGCCTTTGCC 


CCCTGCCATG 


3360 


AAGCGCTATC 


TGCTCTACAA 


ATGACCCAGT 


agta'cagggt 


GTGCTGGCAC 


CCTACCGTGG 


3420 


GGACAGGTGG 


AGAGGCACCC 


GCTGGCCTAG 


a^aactttaa 


AAAGCTGGTG 


AAGCTGGGGG 


3480 


GGGGGGGCTG 


GACCCCTTCA 


CCTCCCCTTC> 


^TCACAGGAGC 


AAGACATATA 


GAAATGATAT 


3540 


TAAACACCAT 


GGCAGCCTGG 


GACAAAGAGG 


TTTTTGAAGT 


AAAAAATGAG 


ATGTATTGTC 


3600 


ACAACCTGTT 


TCATTATTGT 


TTTTTGOTTT 


GTTTTACACT 


CCCCCACCCC 


AGGCTAGAGC 


3660 


CCCATCACTG 


TCTTAAGGAA 


TTATGACAAC 


CCACAAAGCT 


CAGGCCCAGG 


TGTTTATTTC 


3720 


CCTTACATGT 


AGGATGGTTC 


ACASaCACAA 


TACAGGGGCT 


TTGGCACCGT 


GGGGGAGGGG 


3780 


ACTATCCCAG 


GCCTCTTAGG 


GTCTCATGTA 


TACCGAATTC 


AGACCCGAAA 


GCTCTGAATT 


3840 


TCTGCATCAG 


ACATCCAGTA 


GAACTTGGGA 


GTGAAGCTAG 


AGCCAAGGCC 


ATCTAAGTGA 


3900 


CAGGCCAAAG 


TGACACGAAGy 


'cCCACTTCCT 


GTGCTCCAAC 


CATGAGTTTC 


CAGCCCAAAC 


3960 


CAATGGAAGG 


TGATTTCAOT 


TGTCAGGGCC 


CAAAGGGACA 


GTCAGTTCTA 


CTCCCTCCCC 


4020 


TCACTAGGAG 


CCACCTTGGT 


GACAGTTGAT 


TCTACCCACT 


GTAAGTGGTA 


AAGGGATTGG 


4080 


CCTGGTCCCA 


ACCATAATAG 


GGCGGTGGAA 


ACGGCTCAGG 


AGGGTACAGC 


GTGGATTAGG 


4140 


CCACAAGATG 


GGGCAGATGA 


TGTCATCAGA 


AGCATGTGAC 


CGGTGGGAGC 


AGTTACTAAA 


4200 


CTTCTGGGCA 


ACCTAGTCCA 


TGCTATGCAG 


GCAGGTAGAG 


GGATGGGCAG 


TGCTCATTGT 


4260 


TTGGCATTGA 


TGATGTCCAC 


AAATTCAGGC 


TTGAGAGATG 


CGCCACCCAC 


AAGGAAGCCG 


4320 


TCCACGTCAG 


GCTGGCTTGC 


CAGCTCTTTG 


CAGGTTGCTC 


CAGTCACAGA 


ACCTGTACCA 


4380 


GGAACAAGAA 


/gacagtttgg 


TCAGGTCTAT 


GATCAGAACA 


CTTAAGCCCC 


ACCTCTCTGT 


4440 
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GCAAGGCAGC CTCAGTCTGT CTTAGCCCAT TTCCGTCTTA GCTAGAGCCA AAGCCACTCA 
CCTCCATAAA TGATCCGGGT GCTCTGAGCC ACCCCATCAT TGACATTGGA TTTCAGCCAT 
CCCCGGAGCT TCTCGTGTAC TTCCTGTGCC TAGAAGGAGG AGK3CAGAGCT ACTAAGTAAG 
CTCCTTCCTA TCTATCATTC AAGGAGTAAA AACCACTGGT /TCTCACATAG AGTTGAGTTT 
CCAGAAAAGC CCCGGGACCA GAGAGTGGCA AGGCTCCAAjT CCCACCAGGC TTGGAATGAA 
CATTTTTGGC AAAGTCACTC TCCTTGGTGA GTTTGGGGGC CCTCTGTCTC TAAAGGGGCT 
TGGATGGGCT CCATAGCTGT GTGAGTCTGT TAAAGQCGGA CAGGCTGAGG AGCTCTGGGT 
AGTTACCTGC TGAGGGGTTG CCGTCTTGCC AGTCGCAATG GCCCACACAG GTTCATAGGC 
CAGGACCACC TTGCTCCAGT CTTTCACATT ATCJTGTGGGG CAGAGAGGAG AGTGAGTAGG 
AAGGAGCTGA CCCGCCAAGC 
(2) INFORMATION FOR SEQ ID NO:46:y 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 264 amine/ acids 

(B) TYPE: amino acid/ 

(C) STRANDEDNESS : s/ngle 

(D) TOPOLOGY: line§ 

<ii) MOLECULE TYPE: DNA/ 

(xi) SEQUENCE DESCRIPT/EON: SEQ ID NO: 46: 

Met Gly Gin Thr Ala/Leu Ala Arg Gly Ser Ser Ser Thr Pro Thr Ser 
1 5 / 10 15 




Gin Ala Leu Tyr 
20 



25 



30 



Leu Ser Ala Pro 
35 



40 



45 



Asn Pro Lys Ai 
50 



55 



60 



Cys Phe Glu 
65 



Lys Arg Gw Tyr Ser Arg Gly Leu His Ala Trp Glu lie Ser Trp Pro 



85 



90 



95 



Leu Glu >Gln Arg Gly Thr His Ala Val Val Gly Val Ala Thr Ala Leu 



4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5000 



Asp Phe Ser Pro Pro Glu Gly Leu Glu Glu Leu 



hxo Pro Asp Leu Val Ala Gin Arg His His Gly Trp 



Cys Ser Glu Asn He Asp Val Lys Glu Gly Gly Leu 



/Arg Arg Pro Val Ala Gin Ser Thr Asp Gly Val Arg Gly 
70 75 80 



100 



105 



110 
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Ala Pro Leu Gin Ala Asp His Tyr Ala Ala Leu Leu Gly Ser Asn Ser 
115 120 / 125 



rys Lei 



Glu Ser Trp Gly Trp Asp lie Gly Arg Gly Lys Leu Tyr His Gin Ser 
130 135 / 140 

Lys Gly Leu Glu Ala Pro Gin Tyr Pro Ala Gly Pro Gin Gly Glu Gin 
145 150 / 155 160 

Leu Val Val Pro Glu Arg Leu Leu Val/Val Leu Asp Met Glu Glu Gly 

165 /l70 175 

Thr Leu Gly Tyr Ser lie Gly Gly Thr Tyr Leu Gly Pro Ala Phe Arg 
180 A.85 190 

Gly Leu Lys Gly Arg Thr Leu Tyr Pro Ser Val Ser Ala Val Trp Gly 
195 200 205 

Gin Cys Gin Val Arg lie Arg/Tyr Met Gly Glu Arg Arg Val Glu Glu 
210 21S 220 

/ 

Pro Gin Ser Leu Leu His Leu Ser Arg Leu Cys Val Arg His Ala Leu 
225 230 / 235 240 

Gly Asp Thr Arg Leu Glf Gin He Ser Thr Leu Pro Leu Pro Pro Ala 

245 / 250 255 



/ 



Met Lys Arg Tyr Leu /Leu Tyr Lys 
260 




(2) INFORMATION FOR SHQ ID NO: 47 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5615 base pairs 

(B) TYPE:/ nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

GTACTTTCTT TATATCTCCA TAATTTTATT TACTATTACT ACATGATACA TTATTTTATA 60 

AAAGTCTTTG T&ACCTCCTT AAGGATTCAC TGCTTAATCT CCAGTGCTTA GCACAAATCA 120 

ttaaatgcga/accagaaact CTTCCAAATG TGTTACATCT ATAACCTCAT TGGATTCTCA 180 

CTACCAACCC CATGCAATAG ATACTAATGT GATCTCTGTC TTACAGAGGA AGAAACAGGC 240 

ACAGGGA&GT TCAGTAATTT GCCCAAGGTC ATACACACAC TGGCCTTCAG GTATTCATGC 300 
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CCGGGGAGTC TGGTCCCACA GCTGGCATGT TTGCCATTAT 
TGTCGGCACT CATTAAGCAC ATTGACAGCT ATGCTTGGTG 
CTCTGTGCTA CATGCTTTAC CTGGATTATT TCAACTGCAC 
ACCATCATTG CTCCTATTTT ACATAACAGA AAACTACAGA 
GGCTCATGCC TGAAATCCCA GCACTTTGGG AGACCCTGTf 

/ 

GGCCGGACGT GGTGGCTCAC ACCTGTAATC TCAGCASTTT 
TCACAAGGTC AGGAGTTCTA GACCAGCCTG GCCA^ATGG 
AAATACAAAA AATAGCTAGG CGTGGTGGCA G^PGCCTGTA 
TGAGGCAGGA GAATCCCCTG AACCTGGGAG^ATGGAGGTTA 
GCTGCACTCC AGCCTGGGCA ACAAGAGCA& GACTCTGTCT 
ATAAAAATAT TTTTTTAAAA ATTAGC^GGG TGTGGTAGCA 
CTTGGGAGGC TGAGGTAGGA GGATOACTTG AGCCCAGGAG 
TGATGGCGCC ACTGCACTCT AGCCTTGGTG ACAGCAAGAC 
AAGAGAAATC GGGCAACTTC QECAAGATCG CGCAGTTAAC 
AACTCGAAGT CTTAATCAGG 7ACACTCTACC AAATGAGATC 
GCATCCAGTA TGAAGACTGG ACCAGCAGGG AGAACTATGA 
AAGCAGATTT CACAGCCTCA GAGGTGGCAC AGGCTGACTC 
ACCAGCCCAG AAACAGTGAC CCAGAATCAC AGGGAAGTAG 
AAGCCCCTCC TTGASCCCAT GCTCCTTACC CTCAGGGGCG 
GGCTCAAAGG TCOTGACGGT GGAGAACACC ATCCCCAGGG 
ATCflAAGCGT jLo» M„ CC„ 
GGGTTAACTG TCCCGGGCCT TCGCCGTGGG GGCGGGGCCT 
GACTGAGACy CGAGGTTAAC CGCCCGGGGT GGGCTCCACG 
CGGCTGCTGC CGGTATAGAG CGGTAACTGC CCAGGAGGGG 
GGCCTCGGAG CTGCACGGCC GTGGGCGGCG ATGAGAGGGT 
AGGGGCGGGG CCGCGGGACG GGCTCGGCCC AAGGGAGGAG 



ATTATAT^GC CTCCTTATAG 360 

AGTGACTACT ATGTACCCAG 420 

AAGAACCCTG TGAGGTAACT 480 

TCTGGGGC TGGGCGTAGT 540 

TCTAAAAAAA ATTTTTTTTT 600 

GGGAGGCTAA GGCAGGCAGA 660 

CAAAACCCTG TGTCTACTAA 720 

ATCCCAGCTA CTCAGGAGGC 780 

CAGAGAGCCG AGATCGTGCC 840 

CGAAAAAAAT AAAAATAAAA 900 

CATGCCTGTA GTCCCAGCTA 960 

GTCAAGGCTG CAGTGGGCTG 1020 

CCTGTCTCAA AAAAAAAAAA 1080 

TAGTGGCATA GCTTCACTCA 1140 

AACGGCTCAG TAATGGATTG 1200 

TGCGTACAGC CTAGAGCCTG 1260 

ACAACCCGGG GCAGAAAGGG 1320 

AAATGGGATT CGGCACAATG 1380 

CAGGAGTTAG TCGCTCAGGC 1440 

ATTCCCGACG CGGTGATGCC 1500 

ACTCTGCCGC AGCAAGAGAA 1560 

CGGGGAGGGT CACAGCCCGG 1620 

GGGGCGGGGC ATGCTCTCCG 1680 

GCGGGGCCCC ACAGGGGCGT 1740 

TAAGCCCCAG AGGGCCCTGG 1800 

CTGGGGGCGG AAGCGGCCGG 1860 
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CGGTCTGCGC CCTGCGCGCC TCGGCTTCTT TCCGCCCGGC TCCT'J'CAGAG GCCCGGCGAC 1920 

CTCCAGGGCT GGGAAGTCAA CCGAGGTTCG GGGGCAGCGG CQXGGGCTCC GGGCGAGTAA 1980 

GGGGGATGGT CCATGCTGAG GCCCAAATGG GGCGAACTCG/CGAGAGTCTC TGGCGACCTG 2040 

GATCAGATGG GGCGAGGGCA GATGAAGGGC CCAGGAGC^T TGGGGCAGCG AGGAGGGAGG 2100 

AGCGGGCCCG TTGGCAAACT TGGGTGAAAG GATGGG0TAC CTGGGTGACG AGCCCCCGCC 2160 

AGGATTCTGC TCTTCACGCC CCTTTTCTCC CAGCfcCCTT CCAGGTCAAT CCAAACTGGA 2220 

GCTCAACTTT CAGAAGAGAA AGACGCCCCA GC^AGCCTCT TTCGGGGAGT CCTCTAGCTC 2280 

CTCACCTCCA TGGGCCAGAC AGCTCTGGCA GGGGGCAGCA GCAGCACCCC CACGCCACAG 2340 

GCCCTGTACC CTGACCTCTC CTGTCCCGAt/ GGCTTGGAAG AGCTGCTGTC TGCACCCCCT 2400 

CCTGACCTGG GGGCCCAGCG GCGCCACG&T TGGAACCCCA AAGACTGTTC AGAGAACATC 2460 

GAGGTCAAGG AAGGAGGGTT GTACTTO'GAG CGGCGGCCCG TGGCCCAGAG CACTGATGGG 2520 

GCCCGGGGTA AGAGGGGCTA TTCAAGGGGC CTGCACGCCT GGGAGATCAG CTGGCCCCTA 2580 

GAGCAGAGGG GCACGCATGC CGJ^GGTGGGC GTGGCCACGG CCCTCGCCCC GCTGCAGACT 2640 

GACCACTACG CGGCGCTGCT QfiGCAGCAAC AGCGAGTCGT GGGGCTGGGA CATCGGGCGG 2700 

GGGAAGCTGT ACCATCAGAG /CAAGGGGCCC GGAGCCCCCC AGTATCCAGC GGGAACTCAG 2760 

GGTGAGCAGC TGGAGGTGCjC AGAGAGACTG CTGGTGGTTC TGGACATGGA GGAGGGAACT 2820 

CTGGGCTACG CTATTGGQ&G CACCTACCTG GGGCCAGCAT TCCGCGGACT GAAGGGCAGG 2880 

ACCCTCTATC CGGCAGtfAAG CGCTGTCTGG GGCCAGTGCC AGGTCCGCAT CCGCTACCTG 2940 

GGCGAAAGGA GAGGTjGAGGC CTGGGGCAGA CGTGGGGAGA ACTTTCTGTC CCTGGTGGCA 3000 

GTGGTTTGGG ATGQ&AACTC TTCTGACAAG AGCAGAGGGG ATGGACCTTC ATCCAGCCTG 3060 

CCTCAACCTC TGfTCAGTGC TGGGAAAGGC TAGGGGTCTT CACAGCTGTT ATTTAATTTA 3120 

ACCCAACAGC AATAGAGGTG AAACAGGCTT GAGAAAGCAA CTTTCTCAAG TTCTCTTGGC 3180 

CAGTAAATGG TGAACCTTCA GAATGGAGGG AGGAACTGCA GGGATGAGAG AATTCAGGAG 3240 

ATATCAACCc/cTGAGCAAGA GGTGCAAAGC GTTAGGTACT GGGTTTGATG TACAGGTCCA 3300 

AAAGAAGGAT GGGCAGAGCC AGGTACCCAG GCTGTATACC GGATTCCCTG GGCTCTAACC 3360 

TGTCTCTGTG CCACATACCT ACTTCCTTCC TCAGCCACAC CTCTGGATGG AGACACTGGG 3420 
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CCCCACCTCC ATGTCCGAGG GCTCAGTCTA GTCCTCAGCC CAC^CCACCT CAGCCGGGAA 
CCAAAGCCAC TCACCTCCAT AAATGATACG GGTGCTCTGA G^CACCGCAT CAGAGACGTT 
GGACTTCAGC CATCCTCGGA GCTTCTCGTG TACTTCCTGG GCCTAGAACA AGAAGCTGGC 
CTAAGTAAGA CCTTTTCTGC CTCTCTAAGA GGAAAAATCA CTGGCACCAG TGGACACTTA 
GTGTGGTTTC TGACTGAGTC AGAGTACCAG GGCT^TGATC CAAGCCAGGC CCTGGACTGG 
ATGCCCTTGG ACAAGTCACT GTCTCTGGGT TC/&GGTCTC TGTGTCTTTG AAATAAGGGG 



TTGCCCCATG TGGGCTGTGT CTGTCCAAAC STATTGAGGC AGGCTGGGAT GAGGGCAGGG 
CTCCTGGGCC CGGTTACCTG TTGGGGTGTT GCAGTCTTGC CAGTACCAAT GGCCCACACA 
GGCTCATAGG CCAGGACGAC CTTGCTCGAG TCCTTCACGT TATCTGCAGG GCAGAGATAC 
AGATGGAGGG AAGGGTGAAC AAGAA^GAGC TCTCCAGCCA GGTTCTCCGG AGTACGAAGA 
ACGGTGGCCT ACTGCCCCCT AGTGjSACATT GGGGG 
(2) INFORMATION FOR SEQ ID NO: 48: 

/ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: l>63 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

<ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE/DESCRIPTION: SEQ ID NO: 48: 

Met Gly Gin' Thr Ala Leu Ala Gly Gly Ser Ser Ser Thr Pro Thr Pro 
1/5 10 15 

Gin Ala L,eu Tyr Pro Asp Leu Ser Cys Pro Glu Gly Leu Glu Glu Leu 
20 25 30 

Leu Ser/Ala Pro Pro Pro Asp Leu Gly Ala Gin Arg Arg His Gly Trp 
35 40 45 

Asn p/o Lys Asp Cys Ser Glu Asn lie Glu Val Lys Glu Gly Gly Leu 
5y0 55 60 

Tyr /Phe Glu Arg Arg Pro Val Ala Gin Ser Thr Asp Gly Ala Arg Gly 
65 / 70 75 80 

Arg Gly Tyr Ser Arg Gly Leu His Ala Trp Glu lie Ser Trp Pro 
85 90 95 



5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5615 
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Leu Glu Gin Arg Gly Thr His Ala Val Val Gly Val Rid Thr Ala Leu 
100 105 / 110 

Ala Pro Leu Gin Thr Asp His Tyr Ala Ala Leu Leu^ Gly Ser Asn Ser 
115 120 / 125 

Glu Ser Trp Gly Trp Asp lie Gly Arg Gly Lys/Leu Tyr His Gin Ser 
130 135 / 140 

Lys Gly Pro Gly Ala Pro Gin Tyr Pro Ala GQ.y Thr Gin Gly Glu Gin 
145 150 Ass 160 

Leu Glu Val Pro Glu Arg Leu Leu Val VsA. Leu Asp Met Glu Glu Gly 

165 2j70 175 

Thr Leu Gly Tyr Ala lie Gly Gly Thr/ Tyr Leu Gly Pro Ala Phe Arg 
180 186 190 

Gly Leu Lys Gly Arg Thr Leu Tyr yPro Ala Val Ser Ala Val Trp Gly 
195 200/ 205 

Gin Cys Gin Val Arg lie Arg T^r Leu Gly Glu Arg Arg Ala Glu Pro 
210 215 / 220 

His Ser Leu Leu His Leu Se/ Arg Leu Cys Val Arg His Asn Leu Gly 
225 230 / 235 240 

Asp Thr Arg Leu Gly Gin jval Ser Ala Leu Pro Leu Pro Pro Ala Met 



245 

Lys Arg Tyr Leu Leu 
260 



250 



255 



/r Gin 



(2) INFORMATION FOR SEQ ID NO: 49: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: &8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDfflDNESS : single 

(D) TOPOLOGY: linear 

( i i ) MOLECULE TYFE : DNA 

(xi) SEQUENCE? DESCRIPTION : SEQ ID NO: 49: 
AGCTAGATCT GGACpCTACA ATGGCAGC 
(2) INFORMATION FOR SEQ ID NO: 50: 
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(i) SEQUENCE CHARACTERISTICS: 
($) LENGTH: 36 base pairs 
\) TYPE: nucleic acid 




(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEC^IDNOiSO 
AGCTAGATCT GCCATCCTAC TCGAGGGGCC AGCTGG 




